☆97Updated this week
Alternatives and similar repositories for substrait-java
Users that are interested in substrait-java are comparing it to the libraries listed below
Sorting:
- Ibis Substrait Compiler☆109Feb 20, 2026Updated last week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,516Updated this week
- ☆58Feb 12, 2026Updated 2 weeks ago
- Apache Arrow Cookbook☆107Jan 1, 2026Updated last month
- The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…☆18Dec 12, 2024Updated last year
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- ☆49Feb 14, 2022Updated 4 years ago
- Community Java bindings for https://github.com/facebookincubator/velox☆39Updated this week
- Java binding to Apache DataFusion☆84Apr 14, 2025Updated 10 months ago
- This is the companion repository for the book How Query Engines Work.☆426Jan 25, 2026Updated last month
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆46Dec 14, 2025Updated 2 months ago
- Official Java implementation of Apache Arrow☆83Updated this week
- A modular acceleration toolkit for big data analytic engines☆67May 6, 2024Updated last year
- Java JNI interface to the TileDB Arrays storage and query engine☆26Jan 24, 2026Updated last month
- ☆11Feb 16, 2026Updated last week
- Apache Calcite Avatica☆268Feb 11, 2026Updated 2 weeks ago
- Spark integrations for working with Lance datasets☆44Updated this week
- Vectorized processing for Apache Arrow☆484Feb 14, 2022Updated 4 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,065Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Oct 30, 2025Updated 3 months ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆888Feb 9, 2026Updated 2 weeks ago
- Tools for generating TPC-* datasets☆31Jun 23, 2024Updated last year
- Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.☆1,039Feb 19, 2026Updated last week
- QTag: Turbocharge Your SQL Comments☆12Jan 30, 2025Updated last year
- Apache Calcite☆5,077Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆556Updated this week
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- calcite-arrow-sample(WIP)☆13Dec 17, 2017Updated 8 years ago
- Sketch Library for vector-based models☆15Mar 30, 2025Updated 10 months ago
- Atomix Jepsen tests☆14Feb 7, 2017Updated 9 years ago
- Apache DataFusion Ballista Distributed Query Engine☆1,977Updated this week
- ## Auto-archived due to inactivity. ## Simple JVM Profiler Using StatsD and Other Metrics Backends☆15Oct 3, 2023Updated 2 years ago
- Statistics about data (cardinality estimation, frequent item detection, approximate counting,...)☆16Jun 21, 2022Updated 3 years ago
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- End-to-end SQL fuzz testing for DataFusion using SQLancer☆12Feb 9, 2026Updated 2 weeks ago
- Quick starts for Teiid WildFly☆25Apr 3, 2019Updated 6 years ago
- Append-only key-value database on a distributed shared-log☆52Aug 14, 2024Updated last year
- ☆19Updated this week
- A query predictor pipeline and service to predict resource usages of Presto queries☆15May 2, 2023Updated 2 years ago