voltrondata / spark-substrait-gateway
Implements a gateway that speaks the SparkConnect protocol and drives a backend using Substrait (over ADBC Flight SQL).
☆16Updated this week
Related projects: ⓘ
- A Spark Connector that reads data from / writes data to Arrow-Flight end-points with Arrow-Flight and Flight-SQL☆37Updated last year
- A native Delta implementation for integration with any query engine☆114Updated this week
- ☆131Updated last month
- Gluten: Plugin to Boost Trino's Performance☆69Updated 10 months ago
- ☆75Updated 2 weeks ago
- LST-Bench is a framework that allows users to run benchmarks specifically designed for evaluating Log-Structured Tables (LSTs) such as De…☆64Updated last week
- Simple project to expose a catalog over REST using a Java catalog backend☆103Updated this week
- TPC-DS benchmark kit with some modifications/fixes☆85Updated last month
- Distributed SQL Query Engine in Python using Ray☆230Updated 9 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆193Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆51Updated this week
- A native Rust library for Apache Hudi, with bindings into Python☆137Updated this week
- Storage connector for Trino☆90Updated 2 weeks ago
- An Extensible Data Skipping Framework☆42Updated last year
- Apache Spark Kubernetes Operator☆48Updated this week
- Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are in…☆82Updated 5 months ago
- ☆232Updated this week
- ☆104Updated last year
- Performance Observability for Apache Spark☆163Updated last week
- A Rust implementation of the Iceberg REST Catalog specification.☆147Updated this week
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆111Updated last month
- A S3 Shuffle plugin for Apache Spark to enable elastic scaling for generic Spark workloads.☆37Updated 4 months ago
- QTag: Turbocharge Your SQL Comments☆12Updated last month
- ☆197Updated last month
- Ibis Substrait Compiler☆92Updated this week
- Multi-hop declarative data pipelines☆86Updated last month
- Lakehouse storage system benchmark☆64Updated last year
- Apache DataFusion Comet Spark Accelerator☆748Updated this week
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated last year
- Open, Multi-modal Catalog for Data & AI, written in Rust☆72Updated 2 months ago