delta-io/kafka-delta-ingest

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/delta-io/kafka-delta-ingest)

delta-io / kafka-delta-ingest

A highly efficient daemon for streaming data from Kafka into Delta Lake

☆440

Alternatives and similar repositories for kafka-delta-ingest

Users that are interested in kafka-delta-ingest are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

delta-io / delta-rs
View on GitHub
A native Rust library for Delta Lake, with bindings into Python
☆3,274Updated this week
delta-io / delta-sharing
View on GitHub
An open protocol for secure data sharing
☆953Updated this week
delta-io / delta-kernel-rs
View on GitHub
A native Delta implementation for integration with any query engine
☆353Updated this week
apache / datafusion-comet
View on GitHub
Apache DataFusion Comet Spark Accelerator
☆1,235Updated this week
delta-incubator / deltaray
View on GitHub
Delta reader for the Ray open-source toolkit for building ML applications
☆46Jan 27, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
delta-io / delta
View on GitHub
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…
☆8,924Updated this week
apache / datafusion-ballista
View on GitHub
Apache DataFusion Ballista Distributed Query Engine
☆2,096Updated this week
apache / iceberg-rust
View on GitHub
Apache Iceberg
☆1,363Updated this week
MrPowers / mack
View on GitHub
Delta Lake helper methods in PySpark
☆328Jan 19, 2026Updated 6 months ago
databricks / iceberg-kafka-connect
View on GitHub
☆284Jul 3, 2025Updated last year
unitycatalog / unitycatalog-rs
View on GitHub
Open, Multi-modal Catalog for Data & AI, written in Rust
☆86Sep 30, 2024Updated last year
rivian / delta-go
View on GitHub
☆94May 5, 2025Updated last year
apache / datafusion
View on GitHub
Apache DataFusion SQL Query Engine
☆9,044Updated this week
delta-incubator / delta-sharing-rs
View on GitHub
A Minimalistic Rust Implementation of Delta Sharing Server.
☆97Mar 17, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
voltrondata / superset-sqlalchemy-adbc-flight-sql-poc
View on GitHub
A proof-of-concept repo that attempts to use Apache Superset with a custom ADBC to Arrow Flight SQL SQLAlchemy driver.
☆25Sep 8, 2023Updated 2 years ago
sjrusso8 / spark-connect-rs
View on GitHub
Apache Spark Connect Client for Rust
☆116Jun 10, 2025Updated last year
databrickslabs / delta-oms
View on GitHub
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics f…
☆42Nov 27, 2023Updated 2 years ago
dbt-labs / dbt-spark
View on GitHub
This repository has moved into https://github.com/dbt-labs/dbt-adapters
☆447Jul 16, 2025Updated last year
unitycatalog / unitycatalog
View on GitHub
Open, Multi-modal Catalog for Data & AI
☆3,472Updated this week
lakekeeper / lakekeeper
View on GitHub
Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.
☆1,400Updated this week
ArroyoSystems / arroyo
View on GitHub
Distributed stream processing engine in Rust
☆4,976Updated this week
delta-io / delta-docs
View on GitHub
Delta Lake Documentation
☆54Jun 19, 2024Updated 2 years ago
timvw / arrow-flightsql-odbc
View on GitHub
☆14Feb 10, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
projectnessie / nessie
View on GitHub
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
☆1,484Updated this week
delta-incubator / delta-dotnet
View on GitHub
DeltaLake bindings for dotnet based on delta-rs
☆68Updated this week
danielbeach / lakescum
View on GitHub
A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.
☆27Mar 25, 2024Updated 2 years ago
estuary / flow
View on GitHub
🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, by managing your data flows wi…
☆957Updated this week
JanKaul / iceberg-rust
View on GitHub
Unofficial rust implementation of Apache Iceberg with integration for Datafusion
☆241Updated this week
turbolytics / sql-flow
View on GitHub
DuckDB for streaming data
☆779Sep 4, 2025Updated 10 months ago
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
tikal-fuseday / delta-architecture
View on GitHub
Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline
☆77Feb 15, 2023Updated 3 years ago
Mause / duckdb-deltatable-extension
View on GitHub
A purely experimental DuckDB Deltalake extension
☆94Jul 20, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
OpenLineage / OpenLineage
View on GitHub
An Open Standard for lineage metadata collection
☆2,568Updated this week
returnString / convergence
View on GitHub
A set of tools for writing servers that speak PostgreSQL's wire protocol
☆93Feb 11, 2026Updated 5 months ago
databrickslabs / dbx
View on GitHub
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
☆463Mar 27, 2026Updated 4 months ago
delta-io / delta-examples
View on GitHub
Delta Lake examples
☆241Oct 8, 2024Updated last year
memiiso / debezium-server-iceberg
View on GitHub
Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake
☆324Updated this week
Eventual-Inc / Daft
View on GitHub
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
☆5,669Updated this week
icelake-io / icelake
View on GitHub
Pure Rust Iceberg Implementation
☆162Aug 13, 2024Updated last year