tikal-fuseday/delta-architecture

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tikal-fuseday/delta-architecture)

tikal-fuseday / delta-architecture

Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline

☆77

Alternatives and similar repositories for delta-architecture

Users that are interested in delta-architecture are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

databrickslabs / delta-oms
View on GitHub
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics f…
☆42Nov 27, 2023Updated 2 years ago
bartosz25 / acid-file-formats
View on GitHub
Code for Apache Hudi, Apache Iceberg and Delta Lake analysis
☆10Feb 2, 2024Updated 2 years ago
projectnessie / nessie-demos
View on GitHub
Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.
☆32Updated this week
ing-bank / rokku
View on GitHub
Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…
☆72Aug 27, 2025Updated 11 months ago
aws-samples / kda-flink-app-autoscaling
View on GitHub
This repo demonstrates how to use AWS application auto-scaling to implement custom-scaling in your Kinesis Data Analytics for Apache Flin…
☆19Feb 21, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AbsaOSS / spark-hofs
View on GitHub
Scala API for Apache Spark SQL high-order functions
☆15Aug 4, 2023Updated 2 years ago
sparsecode / DaFlow
View on GitHub
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…
☆26Jun 7, 2021Updated 5 years ago
bitsondatadev / hive-metastore
View on GitHub
☆25Mar 15, 2024Updated 2 years ago
elastacloud / parquet-usql
View on GitHub
A custom extractor designed to read parquet for Azure Data Lake Analytics
☆13Feb 13, 2018Updated 8 years ago
delta-io / kafka-delta-ingest
View on GitHub
A highly efficient daemon for streaming data from Kafka into Delta Lake
☆440Jun 22, 2026Updated last month
raashidsalih / churn-pipeline
View on GitHub
A custom end-to-end analytics platform for customer churn
☆10May 15, 2025Updated last year
jaceklaskowski / spark-delta-lake-workshop
View on GitHub
Spark and Delta Lake Workshop
☆22Jun 14, 2022Updated 4 years ago
AbsaOSS / atum
View on GitHub
A dynamic data completeness and accuracy library at enterprise scale for Apache Spark
☆30May 13, 2026Updated 2 months ago
xuqinghan / nginx-gunicorn-flask
View on GitHub
Dockerfile for Nginx + Gunicorn + Flask
☆12Dec 24, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
dmatrix / feast_workshops
View on GitHub
A series of workshop modules introducing Feast feature store.
☆18May 31, 2022Updated 4 years ago
qubole / streaminglens
View on GitHub
Qubole Streaminglens tool for tuning Spark Structured Streaming Pipelines
☆17Jan 21, 2020Updated 6 years ago
AstroPlant / astroplant-server
View on GitHub
This project is being deprecated, see https://github.com/astroplant/astroplant-api
☆11Mar 12, 2019Updated 7 years ago
melvic-ybanez / lohika
View on GitHub
A Proof Generator for Entailments, Tautologies, and Semantic Equivalences in First-order Logic
☆44Jan 5, 2026Updated 6 months ago
ysfesr / Building-Data-LakeHouse
View on GitHub
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
☆51Dec 2, 2023Updated 2 years ago
airbnb / sputnik
View on GitHub
☆64Nov 8, 2019Updated 6 years ago
bartosz25 / spark-scala-playground
View on GitHub
Sample processing code using Spark 2.1+ and Scala
☆51Jun 28, 2020Updated 6 years ago
japila-books / delta-lake-internals
View on GitHub
The Internals of Delta Lake
☆186Jun 18, 2026Updated last month
quiltdata / examples
View on GitHub
☆12Oct 24, 2025Updated 9 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jeppe742 / DeltaLakeReader
View on GitHub
Read Delta tables without any Spark
☆47Mar 8, 2024Updated 2 years ago
busbud / pongdome
View on GitHub
Only if you take ping pong seriously. Seriously.
☆14Dec 4, 2022Updated 3 years ago
toolboc / azure-iot-edge-bogus-data-generator
View on GitHub
An IoT Edge Module that generates sample data using [Bogus](https://github.com/bchavez/Bogus)
☆10Dec 8, 2022Updated 3 years ago
arrikto / learn-kubeflow
View on GitHub
Learn Kubeflow with Arrikto
☆15Jan 4, 2022Updated 4 years ago
AbsaOSS / hyperdrive
View on GitHub
Extensible streaming ingestion pipeline on top of Apache Spark
☆47Jul 17, 2025Updated last year
dosvath / kerberos-containers
View on GitHub
Containers for the Kerberos with SSH tutorial.
☆17Dec 19, 2023Updated 2 years ago
sjwiesman / flink-scala-3
View on GitHub
☆36Aug 24, 2022Updated 3 years ago
hablapps / scalacrashcourse
View on GitHub
Crash course in Scala
☆22Apr 14, 2020Updated 6 years ago
ngocquang / logging_system
View on GitHub
Hướng dẫn tạo một hệ thống Log Remote dùng chung cho nhiều dự án/server
☆15Feb 26, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
databrickslabs / feature-factory
View on GitHub
Accelerator to rapidly deploy customized features for your business
☆57Dec 10, 2023Updated 2 years ago
lloydmeta / sparkka-streams
View on GitHub
Power a Spark Stream from anywhere in your Akka Stream Flow
☆12Mar 1, 2016Updated 10 years ago
SibeeshVenu / Realtime-IoT-Device-Data-using-Azure-SignalR-and-Azure-Function-in-Angular
View on GitHub
We will see how we can show the real-time data from our IoT device in an Angular application using Azure SignalR service and Azure Functi…
☆13Jan 3, 2019Updated 7 years ago
leehuwuj / olh
View on GitHub
Open source stack lakehouse
☆25Mar 2, 2024Updated 2 years ago
hurtn / databricks
View on GitHub
☆12Aug 6, 2020Updated 5 years ago
sourav-mazumder / Data-Science-Extensions
View on GitHub
☆70Mar 15, 2021Updated 5 years ago
intenthq / gander
View on GitHub
Html Content / Article Extractor in Scala
☆18May 23, 2018Updated 8 years ago