Slopsome – a VRAM fit calculator and tok/s database for local LLMs
VRAM calculator with crowd-sourced tok/s benchmarks when model cards already exist.

Opinionated llama.cpp VRAM calculator that outputs ready-to-run server commands.
Local LLM hobbyists, developers running inference on consumer hardware
Hugging Face VRAM Calculators · Llama.cpp Documentation · Text Generation WebUI
VRAM calculator with crowd-sourced tok/s benchmarks when model cards already exist.
450k context on 32GB VRAM using turboquant KV cache compression.
Multi-item container fit calculator that actually saves state locally.
2x prefill speedup on 12k+ token contexts by treating GPUs like a production line.
Panama FFM beats JNI for in-process llama.cpp - no sidecar, no HTTP, no native install.
Finally one CLI for Ollama, llama.cpp, and vLLM instead of three separate tools.