voltrondata/spark-substrait-gateway

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voltrondata/spark-substrait-gateway)

voltrondata / spark-substrait-gateway

Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).

☆19

Alternatives and similar repositories for spark-substrait-gateway

Users that are interested in spark-substrait-gateway are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Query-farm / adbc_scanner
View on GitHub
A DuckDB ADBC Scanner Extension - adds support for using ADBC drivers with DuckDB as a client.
☆18Updated this week
apache / datafusion-benchmarks
View on GitHub
Apache DataFusion Benchmarks
☆23May 2, 2026Updated 2 months ago
substrait-io / duckdb-substrait-extension
View on GitHub
☆66Updated this week
qwshen / spark-flight-connector
View on GitHub
A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
☆49Jun 7, 2026Updated last month
tokoko / SparkFlightSql
View on GitHub
☆10Jun 23, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gizmodata / flight-ibis-demo
View on GitHub
This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.
☆12Oct 25, 2024Updated last year
mebauer / ibis-basics
View on GitHub
Ibis Basics: An Introduction to the Portable Python Dataframe Library.
☆10Jun 6, 2024Updated 2 years ago
Kimahriman / hdfs-native
View on GitHub
☆76Updated this week
amsterdata / schemapile
View on GitHub
☆12Jul 8, 2024Updated 2 years ago
dremio / iceberg-auth-manager
View on GitHub
Dremio AuthManager for Apache Iceberg
☆15Jun 8, 2026Updated last month
substrait-io / substrait-python
View on GitHub
☆24Updated this week
oap-project / sql-ds-cache
View on GitHub
Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.
☆37Jan 3, 2023Updated 3 years ago
red-data-tools / red-arrow-duckdb
View on GitHub
A library that provides Apache Arrow support to ruby-duckdb
☆14Jun 22, 2026Updated last month
lancedb / flight-sql-js-client
View on GitHub
A JavaScript client for FlightSQL
☆17Nov 14, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
andizimmerer / imlab-dremel
View on GitHub
Implementation of Google Dremel's storage engine in a custom in-memory DB with query compilation.
☆14Oct 10, 2020Updated 5 years ago
kitaisreal / hash-table-aggregation-benchmark
View on GitHub
☆12Mar 14, 2024Updated 2 years ago
gizmodata / quack-jdbc
View on GitHub
JDBC driver for DuckDB's Quack remote protocol (quack:// URI scheme). Lets any JVM tool query a remote DuckDB server over HTTP.
☆15Updated this week
elephaint / pedpf
View on GitHub
Parameter Efficient Deep Probabilistic Forecasting
☆14Jan 8, 2022Updated 4 years ago
soniahorchidan / crayfish23
View on GitHub
Benchmarking Machine Learning Model Inference in Data Streaming Solutions
☆10Jun 12, 2024Updated 2 years ago
databricks / congruity
View on GitHub
The goal of this library is to provide a compatibility layer that makes it easier to adopt Spark Connect. The library is designed to be s…
☆18Nov 25, 2024Updated last year
t1mm3 / vldb_voila
View on GitHub
Query engine synthesizer based on, our domain-specific language, VOILA
☆14Mar 2, 2021Updated 5 years ago
Wal33D / nhtsa-vin-decoder
View on GitHub
Official NHTSA vPIC API wrapper with offline WMI database fallback. Decodes VINs using government data for complete vehicle specs. Featur…
☆19Oct 23, 2025Updated 9 months ago
apache / arrow-testing
View on GitHub
Auxiliary testing files for Apache Arrow
☆18Jun 23, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
soda-inria / retrieve-merge-predict
View on GitHub
☆12Mar 6, 2026Updated 4 months ago
linkedin / spark
View on GitHub
Apache Spark - A unified analytics engine for large-scale data processing
☆16Jul 24, 2023Updated 3 years ago
stefan-grafberger / mlinspect
View on GitHub
Inspect ML Pipelines in Python in the form of a DAG
☆70Feb 24, 2024Updated 2 years ago
duckdblabs / duckdb-substrait-demo
View on GitHub
☆17Jan 17, 2023Updated 3 years ago
alexmalins / harlequin-databricks
View on GitHub
☆12Dec 19, 2025Updated 7 months ago
naomijub / JVM-rust-ffi
View on GitHub
☆22Jun 6, 2022Updated 4 years ago
intel / PerTaskMemBWMonitoring
View on GitHub
☆11Jan 7, 2023Updated 3 years ago
paleolimbot / duckdb-nanoarrow
View on GitHub
☆75Mar 12, 2026Updated 4 months ago
substrait-io / substrait-validator
View on GitHub
☆15Jul 21, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
a49a / bigdata-sql-benchmark
View on GitHub
Flink, Presto, Trino TPC-DS benchmark
☆16Feb 20, 2023Updated 3 years ago
KevinLee1110 / dynamic-batching
View on GitHub
The official repo for the paper "Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching"
☆18Mar 17, 2025Updated last year
lightdash / dbt-docs-95
View on GitHub
dbt docs but windows 95
☆16Jun 7, 2022Updated 4 years ago
gizmodata / spark-connect-proxy
View on GitHub
A reverse proxy server which allows secure connectivity to a Spark Connect server
☆16Aug 13, 2025Updated 11 months ago
manifesto-ai / core-legacy
View on GitHub
AI-Native Semantic State Layer
☆18Jan 5, 2026Updated 6 months ago
gizmodata / adbc-driver-quack
View on GitHub
Go ADBC driver for DuckDB's Quack remote protocol (quack:// URI scheme). Returns Apache Arrow RecordBatches; supports bulk-ingest via APP…
☆28Updated this week
hpides / autovec-db
View on GitHub
Code for our paper "Evaluating SIMD Compiler-Intrinsics for Database Systems"
☆16Jul 5, 2023Updated 3 years ago