Back to browse
GitHub Repository

Open-source evidence tooling for recorded LLM outputs

1 starsPython

Detect when an LLM silently changes behavior for the same prompt

by catarina_eng·Mar 12, 2026·1 point·4 comments

AI Analysis

●●SolidBig BrainSolve My Problem

Cryptographic proof of AI outputs when compliance teams ask what the model actually said.

Strengths
  • Capture adapters for OpenAI and Anthropic intercept API calls automatically at runtime.
  • Offline verification means no server dependency or network calls to validate bundles.
  • Compare command detects when same prompt produces different model responses over time.
Weaknesses
  • Requires integration at API call layer rather than working with existing logs.
  • AI audit and compliance tooling is becoming crowded with established players.
Category
Target Audience

Compliance teams, legal workflows, AI operations engineers

Similar To

Arize · WhyLabs · MLflow

Similar Projects