Liteparse, an OSS universal fast document parser by LlamaParse team
Local PDF parsing with spatial boxes that rivals LlamaParse without the cloud bill.
A fast, helpful, and open-source document parser
Beats PyPDF and MarkItDown on accuracy without needing GPUs or cloud APIs.
AI agent developers, RAG pipeline builders, document processing engineers
LlamaParse · PyMuPDF · MarkItDown
Because it does not require GPUs, liteparse can be run on any machine, and process a few hundred pages of documents in seconds. It offers higher accuracy than similar tools like PyPDF, PyMuPDF, MarkItDown.
It supports a variety of file formats - PDFs, Office documents, images. It can be one-line installed as a skill for 40+ different AI agents, including Claude Code, Cursor, OpenClaw, Windsurf, and more.
Local PDF parsing with spatial boxes that rivals LlamaParse without the cloud bill.
Per-span confidence scores let you review uncertain OCR before trusting 200k-page runs.
Complete SQLite parser in C with AST generation for tooling and AI systems.
LlamaIndex open-sources their parser core, but LlamaParse cloud still handles complex layouts.
Rust rewrite with PDFium delivers 100x speedup over the Python v1.
First benchmark measuring semantic correctness over text similarity for document parsing.