Data●●Solid
10k harmonized time-series datasets of African data
7,900+ harmonized African datasets with BibTeX provenance and one-line dataset library loading.
Niche GemBig Brain
kossisoroyce
204d ago

Cross-platform dataset search with health scores when Kaggle and HF are fragmented.
Machine learning engineers, Data scientists
Google Dataset Search · Hugging Face Datasets · Kaggle
7,900+ harmonized African datasets with BibTeX provenance and one-line dataset library loading.
518k Vietnamese legal documents fill a massive gap in Southeast Asian NLP datasets.
Clean Parquet dump of 55M Open Library rows saves weeks of data cleaning.
Paste any HF URL to instantly see the full transformer architecture graph.
Useful calibration dataset, but it's just logged outputs without analysis tools.
Small 7k row dataset for media bias research when larger corpora already exist.