Digest AI vs HN About

Make a free 3.8B model as reliable as one 7× bigger at parsing data

Make a free 3.8B model as reliable as one 7× bigger at parsing data

by pcoz·Jun 1, 2026·4 points·1 comment

Visit Project View on HN

AI Analysis

●●SolidBig BrainDark Horse

Deterministic verification loop makes 3.8B models match 7x larger ones for structured extraction.

Strengths

•Regime gate plus exact graph analysis plus explicit refusal is genuinely novel architecture.
•Zero runtime dependencies and runs with no model at all is impressive flexibility.
•Bounded re-extraction loop fills gaps by re-asking with pointed-out missing fields.

Weaknesses

•No benchmarks shown to verify the 3.8B vs 7x larger claim in the README.
•Instructor, Pydantic, and guidance already handle structured LLM output.

Category

Target Audience

Developers using local LLMs for structured data extraction

Similar To

Instructor · Pydantic · guidance

Post Description

https://github.com/pcoz/llm-feedback-control

Similar Projects

Developer Tools●●●Banger

AgentCost – Track, control, and optimize your AI spending (MIT)

One-line wrapping eliminates invisible LLM spend; real cost forecasting and model recommendations.

Solve My ProblemSlick

agentcostin

313mo ago

AI/ML●●●Banger

Datetime-bench: which datetime formats LLMs get right (and wrong)

RFC 3339 hits 88% accuracy while unix epoch fails 50% of the time.

Solve My ProblemDark Horse

diwank

212mo ago

Developer Tools●●Solid

A deterministic middleware to compress LLM prompts by 50-80%

Deterministic prompt compression cuts tokens 50-80% without extra model calls.

Big BrainNiche Gem

rosspeili

303mo ago

Developer Tools●●Solid

SafeRun – Replay debugging and inline prevention for AI agents

Replay-first architecture beats LangSmith's static traces for debugging non-deterministic agents.

Ship ItSolve My Problem

Tidianez

111mo ago

Developer Tools●●Solid

Pviz-parser – codebase parsing package for Python and TS/JS codebases

Compressed JSON bundles fit tight context windows better than pasting files.

Big BrainNiche Gem

pvizgenerator

1024d ago

AI/ML●Mid

Reliably Incorrect – explore LLM reliability with data visualizations

Clever Rubik's cube demo but it's educational content, not a reusable tool.

Eye CandyNiche Gem

dataviz1000

232mo ago