apache / parquet-testingLinks
Apache Parquet Testing
☆73Updated 2 months ago
Alternatives and similar repositories for parquet-testing
Users that are interested in parquet-testing are comparing it to the libraries listed below
Sorting:
- TPC-H benchmark data generation in pure Rust☆197Updated last month
- Compaction runtime for Apache Iceberg.☆95Updated this week
- ☆50Updated 3 months ago
- Apache DataFusion Ray☆222Updated 2 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆264Updated 6 months ago
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆221Updated this week
- A native Delta implementation for integration with any query engine☆271Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆139Updated 3 weeks ago
- Pure Rust Iceberg Implementation☆162Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆229Updated 2 weeks ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆151Updated last month
- ☆33Updated 5 months ago
- Apache DataFusion Benchmarks☆22Updated 3 weeks ago
- View parquet files online☆188Updated last week
- DataFusion TableProviders for reading data from other systems☆152Updated this week
- JSON support for DataFusion (unofficial)☆48Updated last month
- Boring Data Tool☆236Updated last year
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆255Updated this week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆165Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- A BYOC option for Snowflake workloads☆101Updated this week
- ☆48Updated last month
- Apache Iceberg C++☆147Updated this week
- Ibis Substrait Compiler☆105Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆495Updated this week
- Helpers for Arrow C Data & Arrow C Stream interfaces☆208Updated 2 weeks ago
- ☆14Updated 2 years ago
- Distributed pushdown cache for DataFusion☆298Updated this week
- New file format for storage of large columnar datasets.☆626Updated this week