150M AI-Generated Q&A Pages Static

Name: 150M AI-Generated Q&A Pages Static
Availability: InStock
Author: qeeebo

by qeeebo·Feb 20, 2026·4 points·1 comment

Visit Project View on HN

AI Analysis

●MidBold BetWizardry

150M static Q&A pages on CDN, but answers are AI-generated and unvetted.

Strengths

•Impressive infrastructure feat: 150M pre-rendered pages segmented and parallelized across Hugo builds.
•Thoughtful citation support (APA, MLA, Chicago, IEEE) and export formats (BibTeX, RIS, JSON-LD).
•Pure static CDN delivery eliminates server costs and scaling headaches at massive scale.

Weaknesses

•AI-generated content quality is opaque—no clear curation, fact-checking, or human review visible.
•Competes with Wikipedia, ChatGPT, and traditional search; unclear why static Q&A is better than dynamic alternatives for this use case.

Post Description

Over the past 6 months, our small team has been building Qeeebo — a large-scale question-and-answer knowledge archive designed to explore whether massive knowledge corpora can be published sustainably using fully static infrastructure.

This month, we are releasing:

• 150+ million structured questions • 24.5 million topics • 171 million topic-question relationships • 18+ million paginated topic pages • 100% pre-rendered static HTML • No origin servers — served entirely via CDN

Each question includes: – A full answer – A summary – Structured citation formats (APA, MLA, Chicago, IEEE, etc.) – Export formats (BibTeX, RIS, JSON-LD, YAML)

The entire system is generated in independent segments (~45k pages each), built across parallel machines running Hugo, then uploaded via automated multi-threaded pipelines with full failure tracking.

Why build this?

Large Q&A platforms historically struggled with sustainability — especially when operating on database-backed, dynamically rendered systems. We wanted to explore whether extreme-scale static generation could reduce infrastructure cost while increasing long-term durability.

This isn’t positioned as a replacement for Wikipedia or Stack Overflow. Instead, it’s an experiment in permanence and cost-efficient knowledge hosting at very large scale.

Happy to answer technical questions.

Similar Projects

Open Source●●●Banger

Self-hosted static archive of 20 years of Hacker News

Runs 22GB of HN history entirely in-browser using lazy-loaded SQLite shards over WebAssembly.

Rabbit HoleZero to OneWizardry

keepamovin

111mo ago

SaaS●Mid

What Is AI Citation Optimization?

Blog post positioning a SaaS tool, not a product or project worthy of Show HN.

arunkumars91

213mo ago

SaaS●●Solid

Rfp.ai – answer RFPs from approved docs, with source citations

Grounds every answer in approved docs with citations, unlike generic RFP writers.

Solve My ProblemSlick

dutchcode

301mo ago

AI/ML●●Solid

ProofPudding – Document Extraction API with Citations (PDF/Docx)

ProofPudding returns extraction results with explicit links back to the exact page and source text, supports native and scanned PDFs plus DOCX/images, and ships Python/TypeScript SDKs — handy for agents that need auditable facts. It’s a pragmatic product (per-extraction pricing and confidence scores are nice), but the market is crowded; I want clarity on underlying models, real-world accuracy numbers, and how it compares to Document AI/Textract in edge cases.

Solve My ProblemSlick

garai

104mo ago

AI/ML●●Solid

Built a public demo to explore SpaceX's IPO filing using multimodal RAG

Multimodal RAG on SpaceX S-1 with source trails, but document Q&A is a crowded category.

Ship ItNiche Gem

gabamnml

5110d ago

Developer Tools●●Solid

Ctxt.sh – Ask questions about any codebase, get answers with citations

Chat-with-codebase when Cursor, Sourcegraph, Continue already own this space.

Solve My ProblemShip It

Hudsonlatimer

213mo ago