apache / parquet-testing
Apache Parquet Testing
☆54Updated 2 weeks ago
Alternatives and similar repositories for parquet-testing:
Users that are interested in parquet-testing are comparing it to the libraries listed below
- ☆37Updated last week
- Query Plan Markup Language☆45Updated last year
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆179Updated last week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆121Updated last week
- Apache DataFusion Ray☆169Updated last week
- ☆26Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆238Updated 10 months ago
- Extension for DuckDB for functions that require the Apache Arrow dependency☆39Updated last month
- A native Delta implementation for integration with any query engine☆206Updated this week
- ☆82Updated this week
- ☆25Updated 2 weeks ago
- Boring Data Tool☆214Updated last year
- Distributed SQL Query Engine in Python using Ray☆244Updated 5 months ago
- In-Memory Analytics with Apache Arrow, published by Packt☆97Updated last year
- Prototype compiler from SaneQL to SQL☆81Updated last year
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆105Updated last month
- Official Go implementation of Apache Arrow☆125Updated this week
- ☆105Updated last year
- ☆231Updated this week
- Apache Iceberg C++☆51Updated this week
- Sample code to accompany blog post showcasing Arrow Flight SQL running on DuckDB☆32Updated 2 years ago
- Rust implementation of Apache Iceberg with integration for Datafusion☆153Updated this week
- Ibis Substrait Compiler☆100Updated this week
- JSON support for DataFusion (unofficial)☆38Updated last week
- Pure Rust Iceberg Implementation☆163Updated 7 months ago
- Strategically Deconstruct UDFs with PRISM for Faster Query Plans☆14Updated 4 months ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆106Updated last week
- Apache Arrow Cookbook☆101Updated last month
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).