substrait-io/substrait-java

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/substrait-io/substrait-java)

substrait-io / substrait-java

☆101

Alternatives and similar repositories for substrait-java

Users that are interested in substrait-java are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ibis-project / ibis-substrait
View on GitHub
Ibis Substrait Compiler
☆111Updated this week
substrait-io / substrait
View on GitHub
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,535Updated this week
substrait-io / substrait-python
View on GitHub
☆24Updated this week
apache / gluten
View on GitHub
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
☆1,576Updated this week
voltrondata / substrait-r
View on GitHub
An R Interface to the 'Substrait' Cross-Language Serialization for Relational Algebra
☆28Apr 21, 2023Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
apache / arrow-cookbook
View on GitHub
Apache Arrow Cookbook
☆107Jul 10, 2026Updated last week
substrait-io / bft
View on GitHub
The (B)ig (F)unction (T)axonomy is a detailed reference for common compute functions executed by different libraries, databases, and tool…
☆18Dec 12, 2024Updated last year
substrait-io / substrait-cpp
View on GitHub
☆17Apr 10, 2026Updated 3 months ago
substrait-io / substrait-validator
View on GitHub
☆15Updated this week
andygrove / how-query-engines-work
View on GitHub
This is the companion repository for the book How Query Engines Work.
☆455Jan 25, 2026Updated 5 months ago
Stream-SQL-TCK / Stream-SQL-TCK
View on GitHub
☆13Mar 2, 2018Updated 8 years ago
qwshen / spark-flight-connector
View on GitHub
A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
☆49Jun 7, 2026Updated last month
intel / BDTK
View on GitHub
A modular acceleration toolkit for big data analytic engines
☆66May 6, 2024Updated 2 years ago
zabetak / calcite-tutorial
View on GitHub
☆49Feb 14, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
oap-project / gazelle_plugin
View on GitHub
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
☆255Feb 21, 2023Updated 3 years ago
xskipper-io / xskipper
View on GitHub
An Extensible Data Skipping Framework
☆50Jul 15, 2025Updated last year
facebookincubator / velox
View on GitHub
A composable and fully extensible C++ execution engine library for data management systems.
☆4,172Updated this week
lance-format / lance-spark
View on GitHub
Spark integrations for working with Lance datasets
☆60Updated this week
apache / celeborn
View on GitHub
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
☆1,056Updated this week
rymurr / flight-spark-source
View on GitHub
☆109Jul 5, 2023Updated 3 years ago
apache / calcite
View on GitHub
Apache Calcite
☆5,157Updated this week
apache / datafusion-ballista
View on GitHub
Apache DataFusion Ballista Distributed Query Engine
☆2,091Updated this week
boostscale / velox4j
View on GitHub
Community Java bindings for https://github.com/facebookincubator/velox
☆43Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
direct-spark-sql / direct-spark-sql
View on GitHub
a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.
☆13Jun 13, 2023Updated 3 years ago
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,230Updated this week
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
0x0L / pgeon
View on GitHub
Apache Arrow PostgreSQL connector
☆64Feb 12, 2024Updated 2 years ago
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Updated this week
datafusion-contrib / tpctools
View on GitHub
Tools for generating TPC-* datasets
☆33Jun 23, 2024Updated 2 years ago
prestodb / presto-query-predictor
View on GitHub
A query predictor pipeline and service to predict resource usages of Presto queries
☆14May 2, 2023Updated 3 years ago
datafusion-contrib / datafusion-java
View on GitHub
Java binding to Apache DataFusion
☆87May 4, 2026Updated 2 months ago
snowflakedb / snowflake-hive-metastore-connector
View on GitHub
☆13Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
apache / auron
View on GitHub
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…
☆1,778Updated this week
apache / calcite-avatica
View on GitHub
Apache Calcite Avatica
☆270Jun 24, 2026Updated 3 weeks ago
zrlio / albis
View on GitHub
Albis: High-Performance File Format for Big Data Systems
☆21Jul 12, 2018Updated 8 years ago
datafusion-contrib / datafusion-tokomak
View on GitHub
Optimizer for DataFusion based on the egg framework
☆16Mar 17, 2022Updated 4 years ago
linkedin / transport
View on GitHub
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…
☆306Jun 29, 2026Updated 2 weeks ago
apache / arrow-flight-sql-postgresql
View on GitHub
Apache Arrow Flight SQL adapter for PostgreSQL
☆111Jun 22, 2026Updated 3 weeks ago
voltrondata / spark-substrait-gateway
View on GitHub
Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).
☆19Feb 10, 2025Updated last year