Apache DataFusion Benchmarks
☆22Mar 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for datafusion-benchmarks
Users that are interested in datafusion-benchmarks are comparing it to the libraries listed below
Sorting:
- TPC-H benchmark data generation in pure Rust☆233Mar 11, 2026Updated last week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- Rust object_store crate☆230Mar 12, 2026Updated last week
- Collection of AWS Lambdas for creating and managing Delta tables☆57Updated this week
- JSON support for DataFusion (unofficial)☆55Mar 11, 2026Updated last week
- Python bindings and arrow integration for the rust object_store crate.☆65Aug 5, 2024Updated last year
- Interpreter for a small subset of the Haskell programming language☆16Dec 11, 2025Updated 3 months ago
- ☆11Mar 14, 2024Updated 2 years ago
- End-to-end SQL fuzz testing for DataFusion using SQLancer☆12Feb 9, 2026Updated last month
- Community Java bindings for https://github.com/facebookincubator/velox☆40Updated this week
- 🚀A fast, stable and embedded k-v database in pure Golang, supports string, list, hash, set, sorted set. 一个 Go 语言实现的快速、稳定、内嵌的 k-v 数据库。☆10Jun 23, 2021Updated 4 years ago
- Port of TPC-DS dsdgen to Java☆22Nov 29, 2022Updated 3 years ago
- Rust based high-performance Apache Uniffle shuffle-server☆62Feb 28, 2026Updated 2 weeks ago
- YTsaurus SPYT provides an integration with Apache Spark☆19Updated this week
- ☆23Jan 23, 2022Updated 4 years ago
- ☆12Mar 6, 2026Updated 2 weeks ago
- ☆19Dec 1, 2025Updated 3 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55May 13, 2024Updated last year
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 6 months ago
- ☆12Jan 7, 2023Updated 3 years ago
- ☆18Jan 12, 2026Updated 2 months ago
- A purely experimental DuckDB Deltalake extension☆95Mar 13, 2026Updated last week
- Python Package for ducklake☆20Jun 5, 2025Updated 9 months ago
- Tools for generating TPC-* datasets☆31Jun 23, 2024Updated last year
- A Persistent Key-Value Store designed for Streaming processing☆120Jan 13, 2026Updated 2 months ago
- ☆17Sep 20, 2021Updated 4 years ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆270Updated this week
- The source for REST API specifications for Microsoft Fabric.☆35Feb 3, 2026Updated last month
- Flink, Presto, Trino TPC-DS benchmark☆15Feb 20, 2023Updated 3 years ago
- A no code tool to help build pivot tables from any database with a few clicks.☆24Dec 12, 2025Updated 3 months ago
- ☆12Jan 31, 2021Updated 5 years ago
- The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage o …☆31Jun 18, 2025Updated 9 months ago
- Tools to sort, merge, write, and read immutable key-value pairs☆26Sep 18, 2025Updated 6 months ago
- Your SQL database for learning purpose☆78Sep 13, 2025Updated 6 months ago
- A QA RAG system that uses a custom chromadb to retrieve relevant passages and then uses an LLM to generate the answer.☆17Feb 28, 2024Updated 2 years ago
- Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"☆16Jul 5, 2023Updated 2 years ago
- Olympia is a storage-only open catalog format for big data analytics, ML & AI.☆16May 5, 2025Updated 10 months ago
- A Rust/Python library for fast avro deserialization☆12Aug 25, 2024Updated last year