Apache DataFusion Benchmarks
☆23Mar 3, 2026Updated last month
Alternatives and similar repositories for datafusion-benchmarks
Users that are interested in datafusion-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TPC-H benchmark data generation in pure Rust☆240Apr 20, 2026Updated last week
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆19Feb 10, 2025Updated last year
- HDFS based on Java implementation as a remote ObjectStore for DataFusion☆10Feb 13, 2024Updated 2 years ago
- Rust object_store crate☆241Updated this week
- pg_linux_stats provides information similar to the Linux commands vmstat, iostat, netstat and mpstat via PostgreSQL.☆16Jan 14, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Collection of AWS Lambdas for creating and managing Delta tables☆57Updated this week
- JSON support for DataFusion (unofficial)☆56Updated this week
- Python bindings and arrow integration for the rust object_store crate.☆66Aug 5, 2024Updated last year
- SQL for Kubernetes resources☆39Sep 17, 2025Updated 7 months ago
- Interpreter for a small subset of the Haskell programming language☆16Apr 19, 2026Updated last week
- ☆12Mar 14, 2024Updated 2 years ago
- (Archived) End-to-end SQL fuzz testing for DataFusion using SQLancer☆13Apr 16, 2026Updated last week
- Community Java bindings for https://github.com/facebookincubator/velox☆41Updated this week
- Port of TPC-DS dsdgen to Java☆22Nov 29, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Rust based high-performance Apache Uniffle shuffle-server☆64Updated this week
- YTsaurus SPYT provides an integration with Apache Spark☆19Updated this week
- ☆10Nov 20, 2014Updated 11 years ago
- ☆23Jan 23, 2022Updated 4 years ago
- A terraform module that deploys Dagster to Azure.☆11May 10, 2021Updated 4 years ago
- ☆19Dec 1, 2025Updated 4 months ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55May 13, 2024Updated last year
- A Python Snowpark CLI for loading the TPC-DI dataset into Snowflake. Additional dbt models for building the data warehouse.☆10Sep 4, 2025Updated 7 months ago
- ☆12Jan 7, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jan 12, 2026Updated 3 months ago
- A purely experimental DuckDB Deltalake extension☆95Apr 23, 2026Updated last week
- Tools for generating TPC-* datasets☆31Jun 23, 2024Updated last year
- ☆22Jun 6, 2022Updated 3 years ago
- Towards Sound Reassembly of Modern x86-64 Binaries (ASPLOS'25)☆21Apr 1, 2025Updated last year
- A Persistent Key-Value Store designed for Streaming processing☆121Jan 13, 2026Updated 3 months ago
- ☆17Sep 20, 2021Updated 4 years ago
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆273Updated this week
- The source for REST API specifications for Microsoft Fabric.☆40Apr 21, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Flink, Presto, Trino TPC-DS benchmark☆16Feb 20, 2023Updated 3 years ago
- A no code tool to help build pivot tables from any database with a few clicks.☆24Apr 14, 2026Updated 2 weeks ago
- ☆12Jan 31, 2021Updated 5 years ago
- The Almaren Framework provides a simplified consistent minimalistic layer over Apache Spark. While still allowing you to take advantage o…☆31Jun 18, 2025Updated 10 months ago
- performance counters in C++☆28Updated this week
- Data Engineering framework written in Python based in Polars.☆14May 1, 2024Updated last year
- A QA RAG system that uses a custom chromadb to retrieve relevant passages and then uses an LLM to generate the answer.☆16Feb 28, 2024Updated 2 years ago