nimtable / iceberg-compactionLinks
Compaction runtime for Apache Iceberg.
☆113Updated this week
Alternatives and similar repositories for iceberg-compaction
Users that are interested in iceberg-compaction are comparing it to the libraries listed below
Sorting:
- Experimental version. A BYOC option for Snowflake workloads☆100Updated last week
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆292Updated this week
- TPC-H benchmark data generation in pure Rust☆218Updated this week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆231Updated this week
- Apache DataFusion Ray☆228Updated 3 months ago
- A native Delta implementation for integration with any query engine☆305Updated this week
- ☆33Updated 8 months ago
- Pure Rust Iceberg Implementation☆162Updated last year
- Postgres protocol frontend for DataFusion☆122Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆85Updated last year
- DataFusion TableProviders for reading data from other systems☆162Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆266Updated this week
- Apache Parquet Testing☆79Updated last month
- ☆357Updated last week
- Message queue and data streaming based on cloud native services.☆115Updated last month
- Arrow Flight SQL Server☆122Updated 6 months ago
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆306Updated this week
- Allow DataFusion to resolve queries across remote query engines while pushing down as much compute as possible down.☆162Updated last month
- Batteries included CLI, TUI, and server implementations for DataFusion.☆185Updated last month
- ☆86Updated 8 months ago
- The observability platform for Iceberg lakehouses.☆410Updated 2 weeks ago
- JSON support for DataFusion (unofficial)☆53Updated 3 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 4 months ago
- This repository is made as read-only filesystem for remote access.☆123Updated last week
- Distributed pushdown cache for DataFusion☆357Updated last week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year
- Embeddable Aggregate Management System for Streams and Queries.☆105Updated 2 months ago
- In-Memory Analytics for Kafka using DuckDB☆146Updated this week
- Rust based high-performance Apache Uniffle shuffle-server☆55Updated this week
- AnyBlob - A Universal Cloud Object Storage Download Manager Built For Cost-Throughput Optimal Analytics!☆144Updated last week