XiangpengHao / parquet-viewerLinks
View parquet files online
☆200Updated 3 weeks ago
Alternatives and similar repositories for parquet-viewer
Users that are interested in parquet-viewer are comparing it to the libraries listed below
Sorting:
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆287Updated last month
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆297Updated this week
- Distributed pushdown cache for DataFusion☆348Updated this week
- TPC-H benchmark data generation in pure Rust☆215Updated last week
- Apache DataFusion Ray☆224Updated 2 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆155Updated last week
- Pure Rust Iceberg Implementation☆161Updated last year
- DataFusion TableProviders for reading data from other systems☆160Updated this week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆228Updated last week
- A User-Defined Function Framework for Apache Arrow.☆108Updated 2 months ago
- Message queue and data streaming based on cloud native services.☆115Updated last week
- ☆53Updated 5 months ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆134Updated 7 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆182Updated 3 weeks ago
- Boring Data Tool☆237Updated last year
- CMU-DB's Cascades optimizer framework☆404Updated 11 months ago
- A native Delta implementation for integration with any query engine☆298Updated this week
- Learn Data Lake From Storage Layer.☆45Updated last year
- Compaction runtime for Apache Iceberg.☆111Updated last week
- (Experimental) Template for Rust-based DuckDB extensions☆87Updated last week
- Rust implementation of the FastLanes compression library☆149Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆262Updated this week
- Apache Parquet Testing☆77Updated 2 weeks ago
- Rust based high-performance Apache Uniffle shuffle-server☆45Updated this week
- Embeddable Aggregate Management System for Streams and Queries.☆103Updated last month
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆270Updated 8 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆143Updated 2 months ago
- Apache DataFusion Benchmarks☆22Updated 2 weeks ago
- Next-Gen Big Data File Format☆537Updated 2 months ago