Show HN: Parqeye – A CLI tool to visualize and inspect Parquet files
github.comI built a Rust-based CLI/terminal UI for inspecting Parquet files—data, metadata, and row-group-level structure—right from the terminal. If someone sent me a Parquet file, I used to open DuckDB or Polars just to see what was inside. Now I can do it with one command.
Nice work—this hits a real pain point with Parquet. My main use case is debugging partitioned datasets on S3 with schema drift and skew, where I care about: which files/partitions have schema mismatches, weird row-group stats (all-null, out-of-range, huge skew), and doing that via metadata only.
Right now parqeye looks mainly single-file focused. Do you have plans for a “dataset mode” that takes a dir/S3 prefix and surfaces per-file/row-group summaries (row counts, min/max, null %, schema diffs vs a reference file) using just Parquet stats so it scales to tens of GB? Or do you see parqeye intentionally staying a single-file inspector?
Looks great!
Another seemingly extremely similar project released in the last few days: https://github.com/raulcd/datanomy
It’s crazy how long we’ve gone without a tool like this. This is huge. Thank you for finally building this!
It is really incredible how poor the parquet tooling has been for years. The cornerstone of data engineering, yet just inspecting a file is needlessly clunky.
Can DuckDB be included in the tool, so you can run queries directly from the UI? [that would avoid opening DBeaver whenever you need that kind of feature]
Hu huuum... https://harlequin.sh/
This tool actually feels pretty solid too.
This is very impressive. Look forward to using this
tried it out. love it.
Such a cool idea!! So helpful
Looks like a nice tool, but failed for me when reading a geoparquet file created using duckdb.
Apart from some visual glitches, this is an INSTANT BUY !
Note: must the Windows binary really be 78MB ?
CLIs are bulky
thank you so much! this was an annoyance of mine for so long. edit: any chance you make a brew package? if you'd like I'd be happy to PR it in.
yep! it’s available as a homebrew tap — you can install it with: `brew install kaushiksrini/parqeye/parqeye`
awesome! i was just looking at a bucket full of parquet files from last year trying to recall some things about them.
i tried to install with brew, but it told me my cli tools were "too out of date". Never seen that before! and also just upgraded.
Will try again tomorrow
wonderous.