apache / parquet-testingLinks
Apache Parquet Testing
☆78Updated last month
Alternatives and similar repositories for parquet-testing
Users that are interested in parquet-testing are comparing it to the libraries listed below
Sorting:
- Compaction runtime for Apache Iceberg.☆113Updated last week
- TPC-H benchmark data generation in pure Rust☆216Updated this week
- ☆53Updated last week
- Apache DataFusion Ray☆227Updated 2 months ago
- ☆33Updated 7 months ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆250Updated last week
- A native Delta implementation for integration with any query engine☆305Updated last week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆273Updated 8 months ago
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Experimental version. A BYOC option for Snowflake workloads☆101Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆229Updated last week
- Pure Rust Iceberg Implementation☆162Updated last year
- Apache DataFusion Benchmarks☆22Updated 2 weeks ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆184Updated last month
- Apache Arrow Flight SQL adapter for PostgreSQL☆102Updated last week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆144Updated 3 months ago
- Apache Iceberg C++☆170Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆265Updated last week
- Boring Data Tool☆239Updated last year
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆36Updated 3 years ago
- DataFusion TableProviders for reading data from other systems☆162Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Query Plan Markup Language☆45Updated last year
- Helpers for Arrow C Data & Arrow C Stream interfaces☆216Updated last week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆144Updated 4 months ago
- Serverless query engine☆141Updated 2 years ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆158Updated 3 weeks ago
- ☆66Updated last week
- [VLDB 2023 Vol 17] "An Empirical Evaluation of Columnar Storage Formats"☆68Updated 2 months ago
- In-Memory Analytics with Apache Arrow, published by Packt☆104Updated 2 weeks ago