Back to browse
GitHub Repository

Agent Skills Evaluation Framework

52 starsPython

Skill Lab – CLI tool for testing and optimizing agent skills

by qu4rk5314·Mar 22, 2026·1 point·0 comments

AI Analysis

●●SolidNiche GemShip It

Security scanning catches data exfiltration before skills go live.

Strengths
  • 33 checks across 5 dimensions with 0-100 scoring per skill
  • Token cost estimates for discovery vs activation phases
  • Auto-generates ~13 trigger tests to validate skill firing
Weaknesses
  • Agent skill evaluation space getting crowded with similar tools
  • No clear differentiation from LangChain eval or AgentOps
Category
Target Audience

AI agent developers and teams building skill-based agent systems

Similar To

LangChain eval tools · AgentOps · Arize Phoenix

Similar Projects

Security●●●Banger

A security scanner for AI Agent Skills

Docker sandbox execution catches runtime threats static analysis alone misses.

Big BrainBold Bet
mayziem
502mo ago
AI/MLMid

We Evaluates Medical Research Agent Skills

Curated prompt library with 420+ skills, but agent skill marketplaces already exist.

Niche Gem
The_resa
202mo ago