BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)
☆280Apr 7, 2025Updated 11 months ago
Alternatives and similar repositories for btrblocks
Users that are interested in btrblocks are comparing it to the libraries listed below
Sorting:
- Next-Gen Big Data File Format☆666Oct 11, 2025Updated 5 months ago
- Fast Static Symbol Table (FSST): efficient random-access string compression☆502Nov 26, 2025Updated 3 months ago
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆147Feb 11, 2026Updated last month
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now an Incubation Stage project at LFAI&Data, part of the Li…☆2,801Updated this week
- New file format for storage of large columnar datasets.☆700Updated this week
- Rust implementation of the FastLanes compression library☆164Mar 12, 2026Updated last week
- ☆613Mar 6, 2026Updated 2 weeks ago
- Collection of experiments to carve out the differences between two types of relational query processing engines: Vectorizing (interpretat…☆267Jul 18, 2018Updated 7 years ago
- An interactive tool for exploring AWS EC2 Instances in the browser. Powered by DuckDB-WASM☆24Jan 14, 2026Updated 2 months ago
- A native storage format for apache arrow☆83Oct 18, 2023Updated 2 years ago
- InkFuse - An Experimental Database Runtime Unifying Vectorized and Compiled Query Execution.☆55May 13, 2024Updated last year
- A User-Defined Function Framework for Apache Arrow.☆112Feb 1, 2026Updated last month
- A composable and fully extensible C++ execution engine library for data management systems.☆4,079Updated this week
- ALP floating point compression in Rust☆50Mar 16, 2026Updated last week
- CMU-DB's Cascades optimizer framework☆405Jan 6, 2025Updated last year
- ALP: Adaptive Lossless Floating-Point Compression☆168Oct 16, 2025Updated 5 months ago
- Code and results for our paper "Analyzing Vectorized Hash Tables Across CPU Architectures" @ VLDB '23.☆28Feb 2, 2024Updated 2 years ago
- SQLStorm: Taking Database Benchmarking into the LLM Era☆78Jan 2, 2026Updated 2 months ago
- AnyBlox runtime and tooling☆36Sep 4, 2025Updated 6 months ago
- Fastest and safest Rust implementation of parquet. `unsafe` free. Integration-tested against pyarrow☆383Jul 31, 2024Updated last year
- Hyrise is a research in-memory database.☆859Updated this week
- C++ library to pack and unpack vectors of integers having a small range of values using a technique called Frame of Reference☆54Feb 19, 2024Updated 2 years ago
- ☆39Jun 20, 2020Updated 5 years ago
- Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data ve…☆6,187Updated this week
- An update-in-place key-value store for modern storage.☆147Jan 2, 2024Updated 2 years ago
- TPC-H benchmark data generation in pure Rust☆233Mar 11, 2026Updated last week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,487Updated this week
- Axiom is a set of reusable and extensible components designed to be compatible with Velox. Its primary purpose is to simplify the process…☆62Updated this week
- Pure-Rust implementation of Fast Static Symbol Tables string compression☆209Updated this week
- An educational OLAP database system.☆1,822Aug 10, 2025Updated 7 months ago
- Apache DataFusion Comet Spark Accelerator☆1,154Updated this week
- GlareDB: A light and fast SQL database for analytics☆1,004Nov 14, 2025Updated 4 months ago
- BI benchmark with user generated data and queries☆73Dec 19, 2024Updated last year
- (Det)erministic deadl(ock) resolution for high-throughput, low-latency, and strongly consistent data stores.☆28May 12, 2025Updated 10 months ago
- LingoDB: A new analytical database system that blurs the lines between databases and compilers.☆297Updated this week
- Apache DataFusion SQL Query Engine☆8,516Updated this week
- ☆75Mar 27, 2025Updated 11 months ago
- Rust implementation of Apache ORC☆30Mar 3, 2026Updated 2 weeks ago
- Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.☆1,532Updated this week