Back to browse
SuperCSV – a typed, self-describing data format

SuperCSV – a typed, self-describing data format

by mmm1·May 14, 2026·1 point·0 comments

AI Analysis

MidBold Bet

Typed CSV format competing against Parquet, Avro, and JSON Schema.

Strengths
  • Formal specification and public test suite included in v1.0 release.
  • Explicit type declarations in headers eliminate schema inference guessing.
Weaknesses
  • Solving a problem already addressed by Parquet, Avro, and Protocol Buffers.
  • Adoption requires convincing ecosystem to support a new file standard.
Category
Target Audience

Data engineers and backend developers dealing with CSV parsing issues

Similar To

Parquet · Avro · JSON Schema

Post Description

I've created a new open-source data format licensed under Apache 2.0.

SuperCSV is a CSV-like data format with explicitly defined types and self-describing files.

SuperCSV v1.0 includes a formal specification, a reference implementation in Go (validation, encoding, decoding), and a public test suite.

https://www.supercsv.com

https://github.com/supercsv/supercsv

Similar Projects

Developer Tools●●Solid

Lodum, a Python Serializer/Deserializer (a.k.a. Load/Dump) Library

Impressive engineering choices — bytecode/AST generation for ~64% faster dumps and explicit Pyodide/WASM support show someone wrestled real performance and portability problems. It bundles one API across JSON, YAML, TOML, MsgPack/CBOR/BSON and adds native numpy/pandas handling plus basic validators and schema output. Still, it lives in a crowded Python serialization space (pickle, orjson, pydantic/serde alternatives), so adoption will hinge on ecosystem compatibility and convincing users to switch.

Niche GemWizardry
webmaven
204mo ago