Detect LLM hallucinations via geometric drift (0.9 AUC, 1% overhead)
Detects hallucinations via latent space geometry instead of text analysis, but 54% detection rate is incomplete.
Self-hosted API monitoring that tells you what to do, not just what happened
Rate-of-change detection catches degradation before P95 thresholds fire.
Backend engineers running high-stakes FastAPI services
Datadog · Sentry · Honeycomb
Tracks P95 latency per route template (e.g. /users/{id}) Learns a baseline dynamically from recent traffic Detects spikes using rate-of-change (not just static thresholds) Computes a 0–100 health score with trend direction (improving / stable / degrading) Stores events in Redis Streams for replay and debugging
One interesting result: In synthetic load tests (gradual latency ramp from ~200ms to ~1200ms over 60 seconds, with a P95 warning threshold at 1000ms), rate-of-change detection consistently surfaced degradation slightly before static threshold alerts. The window is small, but it was often enough to notice system stress before crossing alert thresholds.
Design constraints:
Near-zero overhead on the request path (async, fire-and-forget writes) Must fail silently if Redis is unavailable No external monitoring stack required (runs in-app)
Example usage: pythonapp.add_middleware(RequestMetricsMiddleware, alert_engine=engine)
Context: This is part of a larger system I'm building that integrates cloud services with mobile money APIs (EcoCash, etc.), where partial failures and latency spikes are common. Still early — hasn't been tested under real production traffic yet.
Curious how others are handling early degradation detection in FastAPI or similar systems. Repo: https://github.com/Tandem-Media/fastapi-alertengine PyPI: https://pypi.org/project/fastapi-alertengine/
Detects hallucinations via latent space geometry instead of text analysis, but 54% detection rate is incomplete.
Regression tests catch cross-domain hallucinations, but prompt-based approach won't scale.
Catches cross-branch conflicts that CI misses when AI agents work in parallel.
Passive threat detection middleware for Laravel—zero risk since it never blocks, 130+ patterns logged locally.
Viewer-side spam filter with rate limiting when YouTube's native tools only help streamers.
Slowloris protection and HTTP-detection for MQTT, but competing against Mosquitto acl and other MQTT brokers' built-in auth.