csvkit.org
CSV (Comma-Separated Values) utilities, in the browser
Say hi →

Sample datasets

a few realistic CSVs for testing the tools on this site · updated 4 May 2026

Every tool here works just as well with your own CSV, but sometimes you just want something to drop in and see what the output looks like. The files below are small, realistic, and cover the obvious edge cases — including an intentionally broken one for testing the validators. Each link saves the file to your machine; nothing is uploaded anywhere when you then drop it into a tool.

Browse the full collection on GitHub → The cards below are a curated subset of tinytoolkit-org/csv-datasets. More files (Titanic, wine quality, OHLCV stock data, NYC taxi trips, 50k server metrics) plus reproducible Python generators. CC0 1.0.

Small starter files

Larger datasets

Bigger files for benchmarking, testing memory usage, and seeing how the tools handle thousands or millions of rows. They're synthetic but schema-realistic and use a stable seed, so the same file is reproducible.

How to use these

Click any card to download. Then drop or paste the file into whichever tool fits — CSV to JSON for the clean ones, the viewer for a quick look, or any of the others. The intentionally-broken file (broken-csv.csv) is useful for trying parsers and seeing how they recover.

Want one added?

If you have a real-world file you'd love to see as a sample (anonymised, of course), email [email protected] and I'll add it. The smaller and weirder the better — the goal is to have files that exercise the tools, not pretty demos.

— S., [email protected]