Deep-XPIA – Prompt injection benchmark for multi-agent AI systems
Maps cross-agent injection attacks to real Copilot CVEs with live measurements.

Impressive case study numbers, but this is marketing content not a product launch.
Security teams and event organizers running live infrastructure
Cloudflare · AWS Shield · Fastly
Maps cross-agent injection attacks to real Copilot CVEs with live measurements.
First systematic attack framework proving 7/9 exploits work on AI agents with shell access.
Research-backed decision tree from a Master's thesis, but it's still an AI wrapper on existing frameworks.
Sub-second DDoS mitigation on your servers, but Cloudflare and AWS Shield dominate.
Reimplements dependency functions locally with test verification, challenging the "dependencies are good" mantra.
Agent red-teaming via UI, but attack catalog is shallow and comparison unclear vs. manual testing.