andygrove / how-query-engines-work
This is the companion repository for the book How Query Engines Work.
☆388Updated 2 years ago
Alternatives and similar repositories for how-query-engines-work:
Users that are interested in how-query-engines-work are comparing it to the libraries listed below
- CMU-DB's Cascades optimizer framework☆397Updated 3 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆238Updated this week
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆241Updated 3 weeks ago
- Apache DataFusion Comet Spark Accelerator☆939Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 7 months ago
- New file format for storage of large columnar datasets.☆538Updated this week
- Apache DataFusion Ray☆188Updated last month
- Pure Rust Iceberg Implementation☆163Updated 8 months ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,302Updated this week
- Apache Iceberg☆912Updated this week
- Apache Paimon Rust The rust implementation of Apache Paimon.☆122Updated 2 weeks ago
- A collection of demos showcasing how stream processing can be used to solve real-world problems.☆191Updated last week
- Rust implementation of Apache Iceberg with integration for Datafusion☆170Updated this week
- A native Delta implementation for integration with any query engine☆226Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆125Updated last week
- Streaming and Incremental Computation Framework☆235Updated last year
- Apache DataFusion Ballista Distributed Query Engine☆1,734Updated this week
- Readings in Stream Processing☆122Updated 5 months ago
- The native Rust implementation for Apache Hudi, with Python API bindings.☆211Updated last week
- Sqllogictest (dialect with extensions) parser and runner in Rust.☆190Updated this week
- TPC-H benchmark data generation in pure Rust☆63Updated this week
- 10x lower latency for cloud-native DataFusion☆140Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆608Updated this week
- Apache DataFusion Python Bindings☆448Updated this week
- Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.☆1,459Updated this week
- Boring Data Tool☆218Updated last year
- Quickly view your data☆307Updated last month
- A list papers of learning how to building database system☆217Updated 6 months ago
- Experimenting with persistence in C☆182Updated 3 years ago
- View parquet files online☆151Updated this week