A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL
☆46Dec 14, 2025Updated 2 months ago
Alternatives and similar repositories for spark-flight-connector
Users that are interested in spark-flight-connector are comparing it to the libraries listed below
Sorting:
- ☆108Jul 5, 2023Updated 2 years ago
- A leightweight UI for Lakekeeper☆16Mar 2, 2026Updated last week
- ☆14Dec 8, 2022Updated 3 years ago
- Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).☆20Feb 10, 2025Updated last year
- Fybrik platform - Arrow/Flight module☆15Aug 10, 2024Updated last year
- DB API 2 interface for Flight SQL with SQLAlchemy extras.☆43Sep 30, 2025Updated 5 months ago
- A Table format agnostic data sharing framework☆42Feb 4, 2024Updated 2 years ago
- My custom Python project scaffolding repository.☆12Aug 5, 2025Updated 7 months ago
- SQLAlchemy for Dremio via the ODBC and Flight interface.☆30Jan 8, 2026Updated 2 months ago
- Spark-Radiant is Apache Spark Performance and Cost Optimizer☆25Dec 31, 2024Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- ☆16Nov 27, 2025Updated 3 months ago
- ☆14Feb 13, 2026Updated 3 weeks ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Mar 9, 2021Updated 4 years ago
- ☆21Aug 26, 2025Updated 6 months ago
- ☆18Updated this week
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included☆16Jan 26, 2026Updated last month
- dbt's adapter for dremio☆48Oct 15, 2022Updated 3 years ago
- ☆22Jun 6, 2022Updated 3 years ago
- Open, Multi-modal Catalog for Data & AI, written in Rust☆86Sep 30, 2024Updated last year
- Glue JupyterLab Extension☆20Updated this week
- ☆23May 2, 2024Updated last year
- ☆97Updated this week
- ☆59Feb 12, 2026Updated 3 weeks ago
- Visualize column-level data lineage in Spark SQL☆92May 13, 2022Updated 3 years ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆233Feb 27, 2026Updated last week
- TSG Client is a Python library for interacting with the TNO Security Gateway (TSG) Core Container☆18Mar 28, 2025Updated 11 months ago
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Query and transform data with PRQL☆137Sep 23, 2023Updated 2 years ago
- This is an attempt to start documenting the rust sdk for temporal and how to use it following some of the examples in typescript.☆38Jun 16, 2023Updated 2 years ago
- ☆36Aug 13, 2024Updated last year
- Better, container friendly big-data images for Docker☆39Nov 12, 2016Updated 9 years ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆136Oct 25, 2023Updated 2 years ago
- Apache DataFusion Comet Spark Accelerator☆1,150Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,430Updated this week
- a curated list of awesome lakehouse frameworks, applications, etc☆42Feb 9, 2026Updated last month
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- Snowflake Data Source for Apache Spark.☆229Updated this week
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆38Dec 15, 2025Updated 2 months ago