A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,476Feb 25, 2026Updated this week
Alternatives and similar repositories for substrait
Users that are interested in substrait are comparing it to the libraries listed below
Sorting:
- A composable and fully extensible C++ execution engine library for data management systems.☆4,065Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,521Updated this week
- Apache DataFusion SQL Query Engine☆8,462Updated this week
- Ibis Substrait Compiler☆109Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,984Updated this week
- ☆97Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆556Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,148Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,425Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,715Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,543Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,123Updated this week
- Apache Iceberg☆1,229Updated this week
- the portable Python dataframe library☆6,417Updated this week
- New file format for storage of large columnar datasets.☆693Updated this week
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,239Updated this week
- Transmute-free Rust library to work with the Arrow format☆1,069Feb 27, 2024Updated 2 years ago
- Helpers for Arrow C Data & Arrow C Stream interfaces☆225Updated this week
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆84Updated this week
- CMU-DB's Cascades optimizer framework☆405Jan 6, 2025Updated last year
- A native Rust library for Delta Lake, with bindings into Python☆3,156Updated this week
- Apache DataFusion Python Bindings☆564Updated this week
- Apache Iceberg☆8,572Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,742Updated this week
- Official Rust implementation of Apache Arrow☆3,379Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- Malloy is a modern open source language for describing data relationships and transformations.☆2,405Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆889Feb 9, 2026Updated 3 weeks ago
- Extensible SQL Lexer and Parser for Rust☆3,320Updated this week
- This is the companion repository for the book How Query Engines Work.☆427Jan 25, 2026Updated last month
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,161Updated this week
- Python SQL Parser and Transpiler☆8,965Updated this week
- A modular implementation of timely dataflow in Rust☆3,583Updated this week
- Apache DataFusion Ray☆228Oct 5, 2025Updated 4 months ago
- Apache Calcite☆5,077Updated this week
- Distributed SQL Query Engine in Python using Ray☆245Oct 2, 2024Updated last year
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,608Updated this week
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆303Oct 30, 2025Updated 4 months ago