delta-io / kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
☆366Updated last week
Related projects ⓘ
Alternatives and complementary repositories for kafka-delta-ingest
- Lakekeeper: A Rust native Iceberg REST Catalog☆217Updated this week
- A native Delta implementation for integration with any query engine☆143Updated this week
- Apache DataFusion Comet Spark Accelerator☆816Updated this week
- Apache PyIceberg☆461Updated this week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,031Updated this week
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆191Updated this week
- Performance Observability for Apache Spark☆192Updated this week
- A Python Library to support running data quality rules while the spark job is running⚡☆162Updated this week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆342Updated 5 months ago
- ☆252Updated 2 weeks ago
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆400Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆83Updated 7 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆195Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆213Updated this week
- An open protocol for secure data sharing☆769Updated this week
- Open Control Plane for Tables in Data Lakehouse☆306Updated this week
- Delta Lake helper methods in PySpark☆304Updated 2 months ago
- Spark style guide☆257Updated last month
- Apache Spark Connect Client for Rust☆90Updated last week
- Open, Multi-modal Catalog for Data & AI, written in Rust☆74Updated last month
- ☆150Updated 3 weeks ago
- Snowflake Data Source for Apache Spark.☆216Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆303Updated last year
- Turning PySpark Into a Universal DataFrame API☆317Updated this week
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,129Updated this week
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆187Updated last week