Experimental version. A BYOC option for Snowflake workloads
☆99Mar 19, 2026Updated this week
Alternatives and similar repositories for embucket-labs
Users that are interested in embucket-labs are comparing it to the libraries listed below
Sorting:
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆233Mar 11, 2026Updated last week
- An Ibis back-end for the GizmoSQL Arrow Flight SQL Server (with the DuckDB engine)☆16Mar 2, 2026Updated 2 weeks ago
- ☆35Feb 14, 2026Updated last month
- Composable expressions for data pipelines☆500Updated this week
- Holocron is an object storage based leader election library.☆128Oct 1, 2024Updated last year
- Rust DataFusion Server☆25Updated this week
- portable Python ML-powered data bot☆25Sep 27, 2024Updated last year
- OCRA: Object-store Cache in Rust for All☆16Sep 29, 2025Updated 5 months ago
- Rust crate for Substrait: Cross-Language Serialization for Relational Algebra☆86Mar 12, 2026Updated last week
- Parquet extension☆11Mar 3, 2026Updated 2 weeks ago
- Generated Kafka protocol implementations☆35Mar 4, 2026Updated 2 weeks ago
- LakeSail's computation framework with a mission to unify batch processing, stream processing, and compute-intensive AI workloads.☆1,186Updated this week
- Postgres protocol frontend for DataFusion☆135Updated this week
- A SQL query compiler written in Rust from scratch☆21Sep 21, 2024Updated last year
- A cloud native embedded storage engine built on object storage.☆2,793Updated this week
- Pushdown cache for DataFusion☆390Mar 14, 2026Updated last week
- ☆29Dec 5, 2025Updated 3 months ago
- Pure Rust Iceberg Implementation☆163Aug 13, 2024Updated last year
- Singer.io Tap for MySQL - PipelineWise compatible☆18Sep 20, 2024Updated last year
- ☆33May 9, 2025Updated 10 months ago
- High throughput streaming of Protobuf data from Kafka into DuckDB☆12Mar 4, 2026Updated 2 weeks ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,221Updated this week
- Infraless Database over any s3 storage API.☆21Mar 23, 2024Updated last year
- Arrow-Powered Data Exchange☆15Feb 7, 2025Updated last year
- Documentation and resources for deploying JupyterHub on Hadoop☆19Jul 16, 2019Updated 6 years ago
- Iceberg Playground in a Box☆67Jun 27, 2025Updated 8 months ago
- ☆67May 9, 2025Updated 10 months ago
- QTag: Turbocharge Your SQL Comments☆12Jan 30, 2025Updated last year
- Zap file format compatible with a future version of Bleve☆14Updated this week
- Template-based generation of DAG cards from Metaflow classes, inspired by Google cards for machine learning models.☆30Dec 7, 2021Updated 4 years ago
- Run, mock and test fake Snowflake databases locally.☆179Mar 1, 2026Updated 2 weeks ago
- Proxy for S3☆18Feb 13, 2026Updated last month
- Trino Iceberg Metadata Insights via Streamlit☆16Apr 9, 2025Updated 11 months ago
- Real-time data processing/feature engineering in Rust, Python and SQL. Tailored for modern AI/ML systems.☆76Feb 10, 2026Updated last month
- An Almost Exactly Once Delivery (AEOD) queue☆83Feb 25, 2026Updated 3 weeks ago
- DuckDB API Server with Arrow Flight SQL Airport support and concurrent writes/reads (quackpipe)☆121Mar 5, 2025Updated last year
- Apache Paimon Rust The rust implementation of Apache Paimon.☆150Updated this week
- ☆42Apr 14, 2025Updated 11 months ago
- Simple Workflow Framework - Hamilton + Task Queue (RQ or APScheduler) = FlowerPower☆23Nov 12, 2025Updated 4 months ago