apache / parquet-testingLinks
Apache Parquet Testing
☆73Updated last month
Alternatives and similar repositories for parquet-testing
Users that are interested in parquet-testing are comparing it to the libraries listed below
Sorting:
- TPC-H benchmark data generation in pure Rust☆180Updated 3 weeks ago
- Comptaction runtime for Apache Iceberg.☆87Updated last week
- ☆48Updated 3 months ago
- Apache DataFusion Ray☆219Updated last month
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆222Updated 2 weeks ago
- A native Delta implementation for integration with any query engine☆271Updated this week
- Pure Rust Iceberg Implementation☆162Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 11 months ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆260Updated 5 months ago
- Database connectivity API standard and libraries for Apache Arrow☆486Updated this week
- Apache DataFusion Benchmarks☆21Updated 5 months ago
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆217Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆138Updated last week
- ☆33Updated 4 months ago
- A BYOC option for Snowflake workloads☆101Updated this week
- DataFusion TableProviders for reading data from other systems☆147Updated this week
- This repository is made as read-only filesystem for remote access.☆98Updated last week
- ☆323Updated last week
- Apache Arrow Flight SQL adapter for PostgreSQL☆95Updated 3 weeks ago
- ☆46Updated 2 weeks ago
- Boring Data Tool☆235Updated last year
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆253Updated this week
- New file format for storage of large columnar datasets.☆612Updated last week
- Simple & Real-Time Ingestion into Apache Iceberg.☆176Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆149Updated last week
- Helpers for Arrow C Data & Arrow C Stream interfaces☆206Updated last week
- Batteries included CLI, TUI, and server implementations for DataFusion.☆165Updated 3 months ago
- JSON support for DataFusion (unofficial)☆48Updated last week
- Apache Iceberg C++☆143Updated this week