Back to browse
GitHub Repository

Agents that evolve their own skills. Self-healing multi-agent systems.

2 starsPython

EvoAgents – Agents that evolve their own skills

by jatingargiitk·Mar 2, 2026·1 point·0 comments

AI Analysis

●●●BangerBig BrainSolve My ProblemZero to One

Self-healing agents patch prompts automatically via replay validation; beats manual iteration.

Strengths
  • Replay gating ensures patches only ship after proving improvement on real traces
  • Section-level patching touches only broken constraints, avoiding collateral prompt damage
  • Structured SKILL.md format + versioning enable rollback and audit trail
Weaknesses
  • Depends on LLM evaluator quality; 'judge LLM' may miss real failures or create false positives
  • Early adoption risk: 1 star, 0 forks; no proven production usage yet
Target Audience

AI engineers building multi-agent pipelines, prompt engineers optimizing workflows

Similar To

Promptfoo · DSPy · LangSmith tracing

Post Description

I built an open-source framework where multi-agent systems fix their own prompts.

Each skill is a structured SKILL.md. After every run, an LLM judge scores each skill and tags exact failures. An LLM patcher generates candidate fixes to just the failing section. Each candidate is replayed on past traces. Winner gets promoted. Loser discarded.

One command: evoagents autofix

Key decisions: - LLM-as-judge, not regex — constraints are natural language, evaluation should be too - Section-level patching — only the broken part gets touched - Replay gating — no patch ships without proving it improves on real data - Versioned — every change is a new version, instant rollback

You can steer it: evoagents autofix --guide "prefer primary sources"

pip install evoagents https://github.com/jatingargiitk/evoagents

Similar Projects

Security●●●Banger

OpenClaw skills degrade agent safety

Behavioral safety testing reveals 45 regressions static analysis misses—guardrails provided.

Big BrainWizardryZero to One
shadab_nazar
123mo ago