log-triage¶

log-triage is a Python toolkit that sits between your log collector and an LLM. It reads log files, applies configurable rules to group and classify entries, and emits structured findings and payloads you can forward to a model. The CLI and Web UI share the same YAML configuration so you can run batch jobs, follow live streams, or explore results in a dashboard.

Why use log-triage?¶

Pluggable pipelines. Define multiple pipelines, each with its own grouping strategies and classifiers.
Flexible modules. Decide whether a module runs once (batch) or tails a file with rotation awareness (follow).
Ignore noisy lines. Drop known noise via ignore_regexes before counting errors and warnings.
Pluggable classifiers. Swap between built-in regex or rsnapshot heuristics, or register your own (see Classifiers).
Severity-aware. Findings are labeled WARNING, ERROR, or CRITICAL, and can be escalated when anomalies are detected.
LLM-ready payloads. Generate concise payloads with prompt templates, without coupling to a specific provider.
RAG-enhanced analysis. Automatically retrieve relevant documentation from knowledge bases to provide more accurate, context-aware AI responses with citations.
Web dashboard. Explore findings, edit configuration, and tune regexes in a dark-mode UI.
Alerts and storage. Send webhook or MQTT alerts, and persist findings in SQLite or Postgres for the Web UI.

What you need¶

Python 3.10+
Access to the log files you want to analyze, or a collector such as Fluent Bit writing to disk
Optional: credentials for your preferred LLM provider and database

Installation¶

Create a virtual environment and install the package (including the Web UI, alerts, and RAG extras):

python3 -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install ".[webui,alerts,rag]"

RAG Dependencies¶

The rag extra includes dependencies for Retrieval-Augmented Generation: - sentence-transformers for document embeddings - chromadb for vector storage and semantic search - GitPython for repository cloning and management - markdown for document processing

If you don't need RAG functionality, you can install without it:

pip install ".[webui,alerts]"

Next steps¶

Read the Getting started guide to run your first module.
Explore Configuration to see how pipelines, modules, and prompts fit together.
Visit Web UI to learn how to browse findings and edit the config from the dashboard.
Check out the RAG Quick Start Guide to enable AI-powered analysis with documentation context.
See the full RAG documentation for advanced configuration and troubleshooting.

Security disclaimer concerning the Web UI¶

log-triage is intended for trusted, internal environments and should not be exposed directly to the public internet. The Web UI currently has several security gaps that make it unsuitable for untrusted networks:

No CSRF protections: Cross-site requests can change configuration if an administrator is logged in.
Weak session handling: Sessions rely on a static, HMAC-only cookie with no rotation or expiry; the default secret_key of CHANGE_ME makes forgery trivial in default deployments.
Insecure cookie attributes: Login and session cookies are not limited to HTTPS or restrictive SameSite policies, increasing the risk of theft on shared networks.
Fragile IP allowlisting: Allow lists compare request.client.host strings directly and ignore proxy headers or CIDR ranges, so bypasses are likely behind proxies.
Unvalidated editors: Configuration file editor accepts arbitrary content beyond basic YAML parsing, enabling malicious injections that redirect log ingestion to attacker-controlled files or endpoints.

Run the UI only on private networks, behind strong network controls, and with a unique, secret secret_key until these issues are addressed.