facebookincubator / nimble
New file format for storage of large columnar datasets.
☆482Updated this week
Alternatives and similar repositories for nimble:
Users that are interested in nimble are comparing it to the libraries listed below
- BtrBlocks: Efficient Columnar Compression for Data Lakes (SIGMOD 2023 Paper)☆235Updated 9 months ago
- Apache DataFusion Ray☆158Updated this week
- A cross platform way to express data transformation, relational algebra, standardized record expression and plans.☆1,252Updated this week
- An extensible, state-of-the-art columnar file format☆1,112Updated this week
- Distributed SQL Query Engine in Python using Ray☆243Updated 4 months ago
- Apache DataFusion Comet Spark Accelerator☆890Updated this week
- A native Delta implementation for integration with any query engine☆188Updated this week
- Lakekeeper: A Rust native Iceberg REST Catalog☆448Updated this week
- Database connectivity API standard and libraries for Apache Arrow☆407Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆318Updated last year
- Apache Iceberg☆821Updated this week
- CMU-DB's Cascades optimizer framework☆397Updated last month
- ☆215Updated this week
- Embeddable stream processing engine based on Apache DataFusion☆319Updated 2 months ago
- DuckDB for streaming data☆325Updated this week
- ClickBench: a Benchmark For Analytical Databases☆731Updated this week
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆223Updated this week
- A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to …☆190Updated this week
- An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.☆232Updated 4 months ago
- GlareDB: An analytics DBMS for distributed data☆768Updated this week
- Towards a New File Format☆201Updated last week
- Columnstore Table in Postgres☆510Updated last week
- Analytical database for data-driven Web applications 🪶☆476Updated this week
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive (AI) workloads.☆634Updated this week
- DuckDB-powered data lake analytics from Postgres☆492Updated this week
- ☆271Updated last week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆119Updated last month
- DuckDB-powered analytics in Postgres☆152Updated 8 months ago
- Apache DataFusion Python Bindings☆414Updated this week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆168Updated this week