GitHub Repository

Agent-native inference engine with O(1) fork latency for tree-structured reasoning

3 starsRust

Dendrite – O(1) KV cache forking for tree-structured LLM inference

Name: Dendrite – O(1) KV cache forking for tree-structured LLM inference
Availability: InStock
Author: RyeCatcher

by RyeCatcher·Mar 30, 2026·3 points·1 comment

Visit Project View on HN

AI Analysis

●●●BangerWizardryBig BrainZero to One

O(1) fork latency makes tree search 1000x faster than vLLM for agentic workloads.

Strengths

•Copy-on-write block tables enable constant-time branching without KV cache duplication
•Built-in MCTS and beam search algorithms with UCT scoring out of the box
•Memory efficiency: 1.1GB vs 6GB for 6-branch exploration with 4K prefix

Weaknesses

•Zero stars and 5 open issues signals very early stage, unproven in production
•Only useful for tree-structured inference, not single-sequence chat workloads

Similar Projects

AI/ML●●●Banger

Thaw – Git branch for a running LLM (fork agents, skip prefill)

Git branch for LLM agents — 400x faster forking with preserved KV cache.

WizardryBig BrainSolve My Problem

nilsmatteson

3020d ago

Infrastructure●●●Banger

Ranvier – Prefix-aware routing for LLM inference

Routes LLM requests to GPUs with cached KV prefixes, skipping redundant prefill computation.

WizardryBig Brain

mindsaspire

103mo ago

Developer Tools●●Solid

Composable middleware for LLM inference Optimization Passes

Tower-style middleware stacking for inference guardrails beats bolted-on if-statements.

Big BrainNiche GemShip It

human_hack3r

703mo ago

Developer Tools●●Solid

ACDC – A non-agentic AI coding tool with L0-L3 context cache tiering

Multi-tier caching + tree-sitter indexing, but lacks agent autonomy competitors ship today.

Big BrainNiche Gem

flatmax

124mo ago

Developer Tools●●●Banger

oMLX – Native Mac inference server that persists KV cache to SSD

SSD-cached KV blocks dodge re-prefill tax on context shifts—Claude Code now viable locally.

Solve My ProblemWizardryShip It

jundot

104mo ago

AI/ML●●Solid

Gcontext – a tree of llms.txt files to steer agents on support tasks

llms.txt tree structure lets agents navigate context instead of dumping everything.

Big BrainNiche Gem

bsampera

101d ago