facebookincubator / nimbleLinks
New file format for storage of large columnar datasets.
☆567Updated this week
Alternatives and similar repositories for nimble
Users that are interested in nimble are comparing it to the libraries listed below
Sorting:
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆247Updated 3 months ago
- Apache DataFusion Ray☆207Updated 3 months ago
- Apache DataFusion Comet Spark Accelerator☆988Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,355Updated 2 weeks ago
- An extensible, state of the art columnar file format. Formerly at @spiraldb, now a Linux Foundation project.☆1,329Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 9 months ago
- Next-Gen Big Data File Format☆241Updated 3 weeks ago
- A native Delta implementation for integration with any query engine☆236Updated this week
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆768Updated this week
- ☆291Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆259Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆329Updated 2 years ago
- The Control Plane for Apache Iceberg.☆278Updated this week
- Apache Iceberg☆1,008Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆228Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆466Updated this week
- CMU-DB's Cascades optimizer framework☆401Updated 6 months ago
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆204Updated this week
- Embeddable stream processing engine based on Apache DataFusion☆343Updated 6 months ago
- GlareDB: A light and fast SQL database for analytics☆928Updated last week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆230Updated this week
- This is the companion repository for the book How Query Engines Work.☆393Updated 2 years ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆824Updated this week
- A BYOC option for Snowflake workloads☆80Updated this week
- Distributed pushdown cache for DataFusion☆189Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆130Updated 2 weeks ago
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆206Updated 3 weeks ago
- DuckDB extension for Delta Lake☆193Updated this week
- TPC-H benchmark data generation in pure Rust☆110Updated this week
- Apache Iceberg C++☆96Updated this week