A self upgrading agent that learns from failure
Agent writes its own Python tools and saves rules to avoid repeating mistakes.
The self-improving sandboxed and open-source AI agent. With persistent memory and scheduling.
Writes errors to adaptation.json so it never makes the same mistake twice.
AI developers, automation researchers, agent framework builders
Devin · OpenHands · AutoGen
Agent writes its own Python tools and saves rules to avoid repeating mistakes.
Sandboxed agent that writes its own Python tools and remembers mistakes in JSON.
Agents cheated benchmarks by hardcoding task info into the harness configuration.
Multi-turn adaptive testing finds agent failures static benchmarks miss, but eval space is crowded.
Meta-prompt telling agents to build better agents — no actual code or working system.
Another open-source AI agent—self-improvement is just keyword-matched JSON rules.