Back to browse
GitHub Repository

Utility package for comparing polars data frames.

42 starsPython

Diffly – A Python package to compare polars dataframes

by ekarais·Mar 30, 2026·3 points·0 comments

AI Analysis

●●SolidSolve My ProblemSlick

DataFrame diffing for Polars with ASCII output when pandas comparisons fall short.

Strengths
  • Primary key-based comparison detects missing rows, schema changes, and value diffs.
  • ASCII summary output shows percentages and sample keys for each change type.
Weaknesses
  • Polars-only, no pandas or DuckDB support for broader data team adoption.
  • Dataframe comparison tools exist (great_expectations, pandas-profiling).
Target Audience

Data engineers working with Polars DataFrames

Similar To

great_expectations · pandas-profiling · data-diff

Post Description

We built this after having to manually compare new pipeline outputs with older ones too many times to find out why they are different. It is a simple package that, given two polars dataframes and a primary key, produces a nicely formatted summary of differences. It also has some methods to investigate the differences more deeply.

Similar Projects

DataMid

DataFrame Library Nobody Asked For

Yet another DataFrame library competing against established tools like Polars.

Bold Bet
NavodPeiris
2016d ago
AI/ML●●Solid

Valohai LLM – Track and compare LLM evaluation results in one dashboard

Streams evals from a tiny Python client into a shared dashboard and lets you run parameter sweeps and compare up to six configurations with radar/bar charts and scorecards — exactly the sort of tooling that stops results getting lost in notebooks. Useful, pragmatic product for teams who repeatedly evaluate models, but it's competing with general observability/experiment trackers (W&B, Neptune) and will need strong integrations and metric flexibility to stand out.

Niche GemSolve My Problem
radicain
303mo ago