SuperCSV – a typed, self-describing data format

Name: SuperCSV – a typed, self-describing data format
Availability: InStock
Author: mmm1

by mmm1·May 14, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●MidBold Bet

Typed CSV format competing against Parquet, Avro, and JSON Schema.

Strengths

•Formal specification and public test suite included in v1.0 release.
•Explicit type declarations in headers eliminate schema inference guessing.

Weaknesses

•Solving a problem already addressed by Parquet, Avro, and Protocol Buffers.
•Adoption requires convincing ecosystem to support a new file standard.

Post Description

I've created a new open-source data format licensed under Apache 2.0.

SuperCSV is a CSV-like data format with explicitly defined types and self-describing files.

SuperCSV v1.0 includes a formal specification, a reference implementation in Go (validation, encoding, decoding), and a public test suite.

https://www.supercsv.com

https://github.com/supercsv/supercsv

Similar Projects

Other○Pass

Potatoverse, home for your vibecoded apps

Extremely minimal documentation; unclear what "vibecoded" apps are or how this differs from existing platforms.

born-jre

623mo ago

Infrastructure●●Solid

6cy v0.3.0 – A streaming-first binary archive format

Self-describing archive blocks with mandatory CRC32 and no fallback tricks.

Big BrainNiche GemWizardry

yihac1

104mo ago

Open Source●●●●Gem

Self-contained offline knowledge cards with ULID-DNA and IDsEd25519

DNA-encoded ULID makes knowledge cards globally unique, sortable, and decodable offline forever.

Zero to OneBig BrainNiche Gem

tomneijman

103mo ago

AI/ML●Mid

Data that explains itself to Coding Agents (Bonus: free, BYOA Lovable)

Ambitious self-describing data format, but 'free Lovable' claim oversells it.

Bold BetZero to One

TumbleCow

202mo ago

AI/ML●●Solid

Qwen Meetup, Function Calling Harness, turning 6.75% to 100%

Compiler-level validation turns Qwen's 6.75% structured output success rate into 100%.

Big BrainNiche Gem

samchon

102mo ago

Developer Tools●●Solid

Lodum, a Python Serializer/Deserializer (a.k.a. Load/Dump) Library

Impressive engineering choices — bytecode/AST generation for ~64% faster dumps and explicit Pyodide/WASM support show someone wrestled real performance and portability problems. It bundles one API across JSON, YAML, TOML, MsgPack/CBOR/BSON and adds native numpy/pandas handling plus basic validators and schema output. Still, it lives in a crowded Python serialization space (pickle, orjson, pydantic/serde alternatives), so adoption will hinge on ecosystem compatibility and convincing users to switch.

Niche GemWizardry

webmaven

204mo ago