substrait-io / substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
☆1,252Updated this week
Alternatives and similar repositories for substrait:
Users that are interested in substrait are comparing it to the libraries listed below
- Apache DataFusion Comet Spark Accelerator☆890Updated this week
- Apache DataFusion Ballista Distributed Query Engine☆1,664Updated this week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,400Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,265Updated this week
- New file format for storage of large columnar datasets.☆482Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,129Updated this week
- Apache Iceberg☆821Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆407Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆448Updated this week
- A composable and fully extensible C++ execution engine library for data management systems.☆3,615Updated this week
- Apache DataFusion Python Bindings☆414Updated this week
- Apache PyIceberg☆596Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 4 months ago
- A native Rust library for Delta Lake, with bindings into Python☆2,570Updated this week
- Apache DataFusion SQL Query Engine☆6,783Updated this week
- A native Delta implementation for integration with any query engine☆188Updated this week
- ClickBench: a Benchmark For Analytical Databases☆731Updated this week
- This is the companion repository for the book How Query Engines Work.☆384Updated last year
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,346Updated this week
- Making data lake work for time series☆1,152Updated 6 months ago
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆390Updated 3 weeks ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆822Updated last week
- Transmute-free Rust library to work with the Arrow format☆1,061Updated 11 months ago
- An extensible, state-of-the-art columnar file format☆1,112Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆232Updated 4 months ago
- GlareDB: An analytics DBMS for distributed data☆768Updated this week
- Vectorized processing for Apache Arrow☆484Updated 3 years ago
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year
- Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.☆257Updated 2 years ago
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆235Updated 9 months ago