qwshen/spark-flight-connector

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qwshen/spark-flight-connector)

qwshen / spark-flight-connector

A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL

☆49

Alternatives and similar repositories for spark-flight-connector

Users that are interested in spark-flight-connector are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rymurr / flight-spark-source
View on GitHub
☆109Jul 5, 2023Updated 3 years ago
voltrondata / spark-substrait-gateway
View on GitHub
Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).
☆19Feb 10, 2025Updated last year
lakekeeper / console
View on GitHub
A leightweight UI for Lakekeeper
☆19Updated this week
dashbook / dashtool
View on GitHub
☆17Nov 27, 2025Updated 8 months ago
tustvold / access-log-bench
View on GitHub
☆14Dec 8, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
lakekeeper / lakekeeper-charts
View on GitHub
Helm chart for Lakekeeper - a Rust Native Iceberg REST Catalog
☆25Jul 20, 2026Updated last week
dremio / flightsql-odbc
View on GitHub
☆15Apr 28, 2026Updated 3 months ago
fybrik / arrow-flight-module
View on GitHub
Fybrik platform - Arrow/Flight module
☆15Aug 10, 2024Updated last year
markhoerth / dremio_client
View on GitHub
☆31Mar 14, 2022Updated 4 years ago
lancedb / flight-sql-js-client
View on GitHub
A JavaScript client for FlightSQL
☆17Nov 14, 2025Updated 8 months ago
qwshen / spark-etl-framework
View on GitHub
A generic ETL framework with Spark_SQL for transforming data by constructing pipelines with Yaml/Json/Xml
☆21Feb 3, 2026Updated 5 months ago
narendrans / sqlalchemy_dremio
View on GitHub
SQLAlchemy for Dremio via the ODBC and Flight interface.
☆30Mar 12, 2026Updated 4 months ago
fabrice-etanchaud / dbt-dremio
View on GitHub
dbt's adapter for dremio
☆48Oct 15, 2022Updated 3 years ago
influxdata / flightsql-dbapi
View on GitHub
DB API 2 interface for Flight SQL with SQLAlchemy extras.
☆44Sep 30, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rajagurunath / lakehouse-sharing
View on GitHub
A Table format agnostic data sharing framework
☆42Feb 4, 2024Updated 2 years ago
substrait-io / substrait-java
View on GitHub
☆101Updated this week
quackscience / quackflight
View on GitHub
DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)
☆124Mar 5, 2025Updated last year
JanKaul / frostbow
View on GitHub
☆15Jun 1, 2026Updated last month
PastorGL / datacooker-etl
View on GitHub
ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included
☆16Jun 12, 2026Updated last month
JanKaul / iceberg-rust
View on GitHub
Unofficial rust implementation of Apache Iceberg with integration for Datafusion
☆241Updated this week
substrait-io / duckdb-substrait-extension
View on GitHub
☆66Updated this week
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,233Updated this week
oliverdaff / iceberg-rs
View on GitHub
☆34May 9, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
BryanCutler / SparkArrowFlight
View on GitHub
Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients
☆37Mar 9, 2021Updated 5 years ago
QuantStack / glue-jupyterlab
View on GitHub
Glue JupyterLab Extension
☆20Jul 6, 2026Updated 3 weeks ago
mustafaakin / big-data-docker
View on GitHub
Better, container friendly big-data images for Docker
☆38Nov 12, 2016Updated 9 years ago
ddossot / vertx-react-demo
View on GitHub
Vert.x React Demo
☆14May 18, 2014Updated 12 years ago
kitaisreal / hash-table-aggregation-benchmark
View on GitHub
☆12Mar 14, 2024Updated 2 years ago
gizmodata / quack-jdbc
View on GitHub
JDBC driver for DuckDB's Quack remote protocol (quack:// URI scheme). Lets any JVM tool query a remote DuckDB server over HTTP.
☆15Updated this week
2gis / kafka-connect-hdfs-ext
View on GitHub
Set of extensions for kafka connect hdfs
☆11May 12, 2021Updated 5 years ago
jeremychone / rust-simple-fs
View on GitHub
Simple and convenient API for File System access
☆21Jul 7, 2026Updated 3 weeks ago
nfnty / pkgbuilds
View on GitHub
Personal PKGBUILDs
☆17May 12, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
maropu / spark-sql-flow-plugin
View on GitHub
Visualize column-level data lineage in Spark SQL
☆92May 13, 2022Updated 4 years ago
jnv / ansible-role-debian-backports
View on GitHub
Setup backports repository for Debian and Ubuntu
☆11May 23, 2022Updated 4 years ago
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
soniahorchidan / crayfish23
View on GitHub
Benchmarking Machine Learning Model Inference in Data Streaming Solutions
☆10Jun 12, 2024Updated 2 years ago
Query-farm / adbc_scanner
View on GitHub
A DuckDB ADBC Scanner Extension - adds support for using ADBC drivers with DuckDB as a client.
☆18Updated this week
sky-uk / kfp-operator
View on GitHub
☆20Updated this week
devork / geom
View on GitHub
General geometry parsing for binary and text formats
☆12Sep 23, 2017Updated 8 years ago