Embucket / embucketLinks
A BYOC option for Snowflake workloads
☆101Updated this week
Alternatives and similar repositories for embucket
Users that are interested in embucket are comparing it to the libraries listed below
Sorting:
- Compaction runtime for Apache Iceberg.☆100Updated this week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆82Updated last year
- The observability platform for Iceberg lakehouses.☆369Updated this week
- A native Delta implementation for integration with any query engine☆271Updated last week
- An in-process Parquet merge engine for better data warehousing in S3 with MVCC☆149Updated 5 months ago
- ☆50Updated 3 months ago
- Fully Managed, Streaming Ingestion (CDC) into your Lakehouse☆222Updated last week
- Arrow Flight SQL Server☆112Updated 4 months ago
- 🚀 GizmoSQL — High-Performance SQL Server☆199Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆221Updated last week
- A purely experimental DuckDB Deltalake extension☆95Updated last week
- Apache DataFusion Ray☆221Updated 3 weeks ago
- ☆332Updated this week
- Multi-hop declarative data pipelines☆122Updated this week
- TPC-H benchmark data generation in pure Rust☆202Updated last month
- DuckDB-powered analytics in Postgres☆153Updated last year
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆329Updated 2 years ago
- Iceberg Playground in a Box☆67Updated 4 months ago
- [SIGMOD 2026] F3: The Open-Source Data File Format for the Future☆230Updated 3 weeks ago
- The Airport extension for DuckDB, enables the use of Arrow Flight with DuckDB☆304Updated last week
- Template for DuckDB extensions to help you develop, test and deploy a custom extension☆231Updated 2 weeks ago
- Firebolt Core is a free, self-hosted edition of Firebolt's distributed query engine (https://www.firebolt.io/); it provides high-performa…☆177Updated this week
- Apache DataFusion Benchmarks☆22Updated 3 weeks ago
- In-Memory Analytics for Kafka using DuckDB☆141Updated last week
- Analytical database for data-driven Web applications 🪶☆499Updated 8 months ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆139Updated 2 months ago
- Boring Data Tool☆235Updated last year
- A collection of RBIR projects and posts for anyone interested in joining this journey.☆294Updated this week
- The native Rust implementation for Apache Hudi, with C++ & Python API bindings.☆255Updated this week
- Distributed SQL Query Engine in Python using Ray☆246Updated last year