A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,487Mar 18, 2026Updated this week
Alternatives and similar repositories for substrait
Users that are interested in substrait are comparing it to the libraries listed below
Sorting:
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,532Updated this week
- Ibis Substrait Compiler☆110Updated this week
- ☆100Updated this week
- Apache DataFusion SQL Query Engine☆8,516Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,994Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,154Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆574Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,439Updated this week
- The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query process…☆1,730Updated this week
- Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics☆16,597Updated this week
- the portable Python dataframe library☆6,457Updated this week
- New file format for storage of large columnar datasets.☆700Updated this week
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,187Updated this week
- Apache Iceberg☆1,243Updated this week
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆86Updated this week
- Helpers for Arrow C Data & Arrow C Stream interfaces☆226Mar 6, 2026Updated 2 weeks ago
- CMU-DB's Cascades optimizer framework☆405Jan 6, 2025Updated last year
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,251Updated this week
- Transmute-free Rust library to work with the Arrow format☆1,067Feb 27, 2024Updated 2 years ago
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Feb 21, 2023Updated 3 years ago
- Apache DataFusion Python Bindings☆568Updated this week
- Official Rust implementation of Apache Arrow☆3,403Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆280Sep 25, 2024Updated last year
- ☆14Mar 2, 2026Updated 3 weeks ago
- This is the companion repository for the book How Query Engines Work.☆430Jan 25, 2026Updated last month
- Apache DataFusion Ray☆228Oct 5, 2025Updated 5 months ago
- A native Rust library for Delta Lake, with bindings into Python☆3,169Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,801Updated this week
- Experimental support for serializing DataFusion plans using substrait☆46Jan 13, 2023Updated 3 years ago
- Apache Iceberg☆8,636Updated this week
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆893Mar 10, 2026Updated last week
- Extensible SQL Lexer and Parser for Rust☆3,334Mar 13, 2026Updated last week
- Data Agent Ready Warehouse : One for Analytics, Search, AI, Python Sandbox. — rebuilt from scratch. Unified architecture on your S3.☆9,202Updated this week
- Distributed SQL Query Engine in Python using Ray☆245Oct 2, 2024Updated last year
- ☆33Mar 15, 2026Updated last week
- ☆60Feb 12, 2026Updated last month
- Python SQL Parser and Transpiler☆9,041Updated this week
- Malloy is a modern open source language for describing data relationships and transformations.☆2,421Updated this week