BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
☆279Apr 7, 2025Updated 10 months ago
Alternatives and similar repositories for btrblocks
Users that are interested in btrblocks are comparing it to the libraries listed below
Sorting:
- Next-Gen Big Data File Format☆660Oct 11, 2025Updated 4 months ago
- Fast Static Symbol Table (FSST): efficient random-access string compression☆499Nov 26, 2025Updated 3 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆145Feb 11, 2026Updated 2 weeks ago
- ☆609Updated this week
- New file format for storage of large columnar datasets.☆693Updated this week
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,742Updated this week
- Rust implementation of the FastLanes compression library☆163Updated this week
- A User-Defined Function Framework for Apache Arrow.☆111Feb 1, 2026Updated last month
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆267Jul 18, 2018Updated 7 years ago
- A composable and fully extensible C++ execution engine library for data management systems.☆4,065Updated this week
- ALP: Adaptive Lossless Floating-Point Compression☆165Oct 16, 2025Updated 4 months ago
- CMU-DB's Cascades optimizer framework☆405Jan 6, 2025Updated last year
- A native storage format for apache arrow☆82Oct 18, 2023Updated 2 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55May 13, 2024Updated last year
- Hyrise is a research in-memory database.☆859Updated this week
- An update-in-place key-value store for modern storage.☆147Jan 2, 2024Updated 2 years ago
- ☆39Jun 20, 2020Updated 5 years ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆383Jul 31, 2024Updated last year
- (Det)erministic deadl(ock) resolution for high-throughput, low-latency, and strongly consistent data stores.☆28May 12, 2025Updated 9 months ago
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆28Feb 2, 2024Updated 2 years ago
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,476Updated this week
- Apache DataFusion Comet Spark Accelerator☆1,148Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,003Nov 14, 2025Updated 3 months ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,123Updated this week
- ☆16Jun 11, 2025Updated 8 months ago
- C++ fast transactional key-value storage.☆181Updated this week
- ☆75Mar 27, 2025Updated 11 months ago
- Pure-Rust implementation of Fast Static Symbol Tables string compression☆208Updated this week
- An educational OLAP database system.☆1,815Aug 10, 2025Updated 6 months ago
- ☆81Sep 9, 2025Updated 5 months ago
- Distributed pushdown cache for DataFusion☆385Feb 21, 2026Updated last week
- HOT - Height Optimized Trie☆158Mar 26, 2018Updated 7 years ago
- ☆50Apr 11, 2024Updated last year
- Apache DataFusion SQL Query Engine☆8,462Updated this week
- Apache Iceberg C++☆192Updated this week
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,521Updated this week
- AnyBlox runtime and tooling☆36Sep 4, 2025Updated 5 months ago
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Mar 8, 2025Updated 11 months ago
- An open sourced implementation of Bw-Tree in SQL Server Hekaton☆526Nov 14, 2018Updated 7 years ago