apetrov 5 days ago

not sure why it should be specific to parquet file. a thin UI wrapper around duckdb could do the trick but for majority of formats (pq, csv, json, sqlite, iceberg, deltalake)

  • sanspareilsmyn 5 days ago

    Thanks for your idea. I first thought that simply viewing data might not be useful as many IDEs already handle basic data previews. One of core goals was to access stored metadata directly from the file without necessarily scanning the data. Your idea of potentially mixing current implementation (using pyarrow) with DuckDB is very interesting:)

    • apetrov 5 days ago

      note: with duckdb you get network storage for free (ie delta table on s3 works the same as local) and i guess a smaller deps than pyarrow (might be wrong)

  • dammaj 5 days ago

    I upvote this suggestion and add pickle and hd5 to the list.