Tessera – MCP server that gives Claude persistent memory and local RAG
Local RAG + MCP for Claude with zero external dependencies—elegant constraint execution.
Valkey vector search for Claude Code memory—clever glue, narrow audience.
Claude Code users working on complex, multi-session projects who need persistent architectural context.
Continue.dev · Cursor AI · Aider
https://github.com/BetterDB-inc/memory
The problem: Claude Code has CLAUDE.md and auto memory, but it's flat text with no semantic retrieval. You end up repeating context, especially around things not to do.
BetterDB Memory hooks into Claude Code's lifecycle (SessionStart, PostToolUse, PreToolUse, Stop), summarizes each session, generates embeddings, and stores everything in Valkey using FT.SEARCH with HNSW. Next session, relevant memories surface automatically via vector similarity search.
The interesting technical bit is that Valkey handles all of it - vector search, hash storage for structured memory data, sorted sets for knowledge indexing, lists for compression queues. No separate vector database.
There's also an aging pipeline that applies exponential decay to old memories based on recency, clusters similar ones via cosine similarity, and merges them to keep the memory store from growing unbounded.
Self-hostable with Ollama for embeddings and summarization, or plug in any LLM provider. Runs on Bun, ships as compiled binaries. MIT licensed.
Local RAG + MCP for Claude with zero external dependencies—elegant constraint execution.
MCP server lets Claude search its own past sessions locally without cloud dependency.
Claude Code memory without Redis—138 lines of diskcache beats 'use a vector DB' conventional wisdom.
Persistent memory for Claude Code with auto-compaction recovery—but relies entirely on external API.
Persistent memory layer stores decisions across sessions when other tools forget everything.
Claude Code memory server that auto-reads OAT tokens, routes LLM ops to local or cloud models.