Back to browse
Remoroo. trying to fix memory in long-running coding agents

Remoroo. trying to fix memory in long-running coding agents

by adhamghazali·Apr 14, 2026·3 points·0 comments

AI Analysis

●●●BangerBig BrainWizardry

Demand-paging memory for agents beats context window limits that break Cursor and Devin.

Strengths
  • OS-inspired virtual memory system prevents context slipping in multi-hour agent runs
  • Metric-based keep/discard logic with git-backed experiment artifacts and replay
  • Local overnight execution with Haiku-hour billing beats per-token agent pricing
Weaknesses
  • Memory system internals underspecified in public docs compared to the blog writeup
  • Autonomous code editing space already crowded with Cursor, Devin, and OpenDevin
Category
Target Audience

ML engineers and deep tech teams running iterative experiments

Similar To

Cursor · Devin · OpenDevin

Post Description

I built Remoroo because most coding agents fall apart once the work stops being a short edit-and-run loop.

A real engineering experiment can run for hours. Along the way, the agent reads files, runs commands, checks logs, compares metrics, tries ideas that fail, and needs to remember what already happened. Once context starts slipping, it forgets the goal, loses track of the baseline, and retries bad ideas.

Remoroo is my attempt to solve that problem.

You point it at a repo and give it a measurable goal. It runs locally, tries changes, executes experiments, measures the result, keeps what helps, and throws away what does not.

A big part of the system is memory. Long runs generate far more context than a model can hold, so I built a demand-paging memory system inspired by OS virtual memory to keep the run coherent over time.

There is a technical writeup here: https://www.remoroo.com/blog/how-remoroo-works

Would love feedback from people working on long-running agents, training loops, eval harnesses, or similar workflows.

Similar Projects