apache / parquet-testing
Apache Parquet Testing
☆53Updated last week
Alternatives and similar repositories for parquet-testing:
Users that are interested in parquet-testing are comparing it to the libraries listed below
- ☆36Updated this week
- Apache DataFusion Ray☆170Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆176Updated this week
- Query Plan Markup Language☆45Updated last year
- ☆33Updated 2 years ago
- Apache Arrow Flight SQL adapter for PostgreSQL☆77Updated 2 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆121Updated 2 months ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆151Updated this week
- Pure Rust Iceberg Implementation☆163Updated 7 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆137Updated this week
- JSON support for DataFusion (unofficial)☆37Updated last week
- A native Delta implementation for integration with any query engine☆197Updated this week
- ☆82Updated this week
- Distributed SQL Query Engine in Python using Ray☆244Updated 5 months ago
- Ibis Substrait Compiler☆99Updated this week
- A User-Defined Function Framework for Apache Arrow.☆87Updated last week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆104Updated this week
- Embeddable Aggregate Management System for Streams and Queries.☆91Updated 2 months ago
- Boring Data Tool☆214Updated 11 months ago
- ☆44Updated last week
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆31Updated 2 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆236Updated 10 months ago
- DataFusion TableProviders for reading data from other systems☆89Updated this week
- Helpers for Arrow C Data & Arrow C Stream interfaces☆183Updated this week
- ☆105Updated last year
- Experimental support for serializing DataFusion plans using substrait☆45Updated 2 years ago
- Official Java implementation of Apache Arrow☆34Updated this week
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆104Updated 3 weeks ago
- ☆14Updated 2 years ago