Back to browse
A test harness that blocks unsafe AI actions before execution

A test harness that blocks unsafe AI actions before execution

by celestinestudio·Mar 12, 2026·2 points·0 comments

AI Analysis

●●SolidSolve My ProblemShip It

Deterministic policy matrices block AI agents from executing dangerous API calls.

Strengths
  • Deterministic policy matrices replace unreliable probabilistic safety filters for critical actions.
  • Audit logs create compliance trails for every denied action without extra config.
  • CLI output integrates cleanly into existing CI/CD deployment pipelines for teams.
Weaknesses
  • LinkedIn landing page creates friction instead of linking to GitHub repo.
  • Test harness framing suggests it does not integrate into live agents yet.
Category
Target Audience

AI Agent Developers

Similar To

Guardrails AI · Lakera Guard · Open Policy Agent

Post Description

I built a small test harness that evaluates AI actions before they execute.

Instead of relying only on prompts or output filtering, this introduces an authorization layer that evaluates whether an AI action should be allowed before it runs.

Each requested action is analyzed for signals such as:

• financial actions • external communications • data exports • system modification • destructive operations

Based on the detected signals and required authorization layers, the harness determines whether the action should PASS or DENY.

Example output:

Running 14 tests...

[1/14] financial_commitment -> DENY [2/14] send_external_email -> DENY [3/14] deploy_to_production -> DENY [14/14] general_information -> PASS

Every evaluation produces an auditable record including:

• detected signals • required authorizations • PASS / DENY decision

The goal is to explore what a deterministic execution governance layer might look like for AI systems interacting with real environments.

Demo video walkthrough: https://www.linkedin.com/feed/update/urn:li:activity:7436787... Repository:

https://github.com/celestinestudiosllc/ai-action-authorizati...

Curious how others building agent systems or AI runtimes are approaching execution authorization.

Similar Projects

Security●●●Banger

VellaVeto — blocks unsafe MCP tool calls by default

Fail-closed MCP gateway with formal verification and MCPSEC benchmark suite.

Big BrainWizardryZero to One
paolovella
212mo ago