apache / parquet-testingLinks
Apache Parquet Testing
☆68Updated last week
Alternatives and similar repositories for parquet-testing
Users that are interested in parquet-testing are comparing it to the libraries listed below
Sorting:
- TPC-H benchmark data generation in pure Rust☆131Updated this week
- Comptaction runtime for Apache Iceberg.☆67Updated this week
- ☆47Updated last month
- Apache DataFusion Ray☆217Updated 2 weeks ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆251Updated 4 months ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆213Updated 3 weeks ago
- A native Delta implementation for integration with any query engine☆246Updated last week
- Pure Rust Iceberg Implementation☆162Updated last year
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆136Updated 3 weeks ago
- Distributed SQL Query Engine in Python using Ray☆244Updated 10 months ago
- ☆33Updated 3 months ago
- Batteries included CLI, TUI, and server implementations for DataFusion.☆163Updated last month
- New file format for storage of large columnar datasets.☆586Updated 2 weeks ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆214Updated last week
- Apache Arrow Flight SQL adapter for PostgreSQL☆95Updated last week
- A BYOC option for Snowflake workloads☆88Updated this week
- Boring Data Tool☆226Updated last year
- JSON support for DataFusion (unofficial)☆46Updated 2 weeks ago
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆141Updated 2 weeks ago
- Database connectivity API standard and libraries for Apache Arrow☆477Updated this week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- DataFusion TableProviders for reading data from other systems☆136Updated this week
- View parquet files online☆172Updated 3 weeks ago
- Query Plan Markup Language☆45Updated last year
- In-Memory Analytics with Apache Arrow, published by Packt☆103Updated last year
- Code repo for "An Empirical Evaluation of Columnar Storage Formats" VLDB Vol 17☆62Updated last year
- Apache DataFusion Benchmarks☆20Updated 4 months ago
- ☆302Updated last week
- Distributed pushdown cache for DataFusion☆239Updated last week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆267Updated 10 months ago