A test harness that blocks unsafe AI actions before execution

Name: A test harness that blocks unsafe AI actions before execution
Availability: InStock
Author: celestinestudio

by celestinestudio·Mar 12, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidSolve My ProblemShip It

Deterministic policy matrices block AI agents from executing dangerous API calls.

Strengths

•Deterministic policy matrices replace unreliable probabilistic safety filters for critical actions.
•Audit logs create compliance trails for every denied action without extra config.
•CLI output integrates cleanly into existing CI/CD deployment pipelines for teams.

Weaknesses

•LinkedIn landing page creates friction instead of linking to GitHub repo.
•Test harness framing suggests it does not integrate into live agents yet.

Post Description

I built a small test harness that evaluates AI actions before they execute.

Instead of relying only on prompts or output filtering, this introduces an authorization layer that evaluates whether an AI action should be allowed before it runs.

Each requested action is analyzed for signals such as:

• financial actions • external communications • data exports • system modification • destructive operations

Based on the detected signals and required authorization layers, the harness determines whether the action should PASS or DENY.

Example output:

Running 14 tests...

[1/14] financial_commitment -> DENY [2/14] send_external_email -> DENY [3/14] deploy_to_production -> DENY [14/14] general_information -> PASS

Every evaluation produces an auditable record including:

• detected signals • required authorizations • PASS / DENY decision

The goal is to explore what a deterministic execution governance layer might look like for AI systems interacting with real environments.

Demo video walkthrough: https://www.linkedin.com/feed/update/urn:li:activity:7436787... Repository:

https://github.com/celestinestudiosllc/ai-action-authorizati...

Curious how others building agent systems or AI runtimes are approaching execution authorization.

Similar Projects

Security●●●Banger

Plyra-guard – intercepts AI agent tool calls before execution

Sub-2ms policy guard for agent tool calls—real safety layer where none existed.

Solve My ProblemWizardry

plyra

103mo ago

AI/ML●●Solid

Agent Action Guard – AI agent action safety

HarmActionsEval benchmark proves GPT and Claude fail at blocking harmful tool use.

Solve My ProblemNiche Gem

praneeth-v

202mo ago

Security●●●Banger

VellaVeto — blocks unsafe MCP tool calls by default

Fail-closed MCP gateway with formal verification and MCPSEC benchmark suite.

Big BrainWizardryZero to One

paolovella

212mo ago

Developer Tools●●●Banger

Open-Source Agentic QA Harness with Memory

Self-healing tests that remember UI changes so you stop fixing broken selectors.

Solve My ProblemBig BrainShip It

pranshuchittora

18226d ago

AI/ML●●●Banger

OpenCastor Agent Harness Evaluator Leaderboard

263k config search space benchmarked across robot fleets—nothing like this exists for robotics AI.

Zero to OneBig BrainNiche Gem

craigm26

312mo ago

Developer Tools●●●Banger

Orbit – A harness that makes AI coding agents prove their work

Finally forces AI agents to prove their work with real test gates instead of hallucinated confidence.

Big BrainSolve My Problem

varmahaj

1027d ago