LearningJournal/Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LearningJournal/Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse)

LearningJournal / Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse

☆66

Alternatives and similar repositories for Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse

Users that are interested in Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LearningJournal / Spark-Streaming-In-Python
View on GitHub
Apache Spark 3 - Structured Streaming Course Material
☆125Aug 19, 2023Updated 2 years ago
rockthejvm / udemy-spark-streaming
View on GitHub
For Udemy students: the official repository of Rock the JVM's Spark Streaming course
☆26Jan 5, 2023Updated 3 years ago
airscholar / Kubernetes-For-DataEngineering
View on GitHub
This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…
☆25Jan 26, 2024Updated 2 years ago
CodyAustinDavis / edw-best-practices
View on GitHub
Git Repo for EDW Best Practice Assets on the Lakehouse
☆16Dec 11, 2023Updated 2 years ago
TybulOnAzure / DP-203
View on GitHub
☆88Mar 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cloudboxacademy / covid19
View on GitHub
Resources for the Udemy Course - Azure Data Factory For Data Engineers - Project on Covid19 by Ramesh Retnasamy
☆275Feb 10, 2024Updated 2 years ago
Hamagistral / Azure-AW
View on GitHub
🔧 Azure Data Engineering Project (On-premise db to the cloud)
☆20Mar 30, 2024Updated 2 years ago
awslabs / cloudformation-ldaps-haproxy-template
View on GitHub
Configure an LDAPS Endpoint for Simple AD
☆14Aug 29, 2017Updated 8 years ago
Monarene / data-engineering-notes
View on GitHub
This repository shows my personal notes taken while doing the Udacity Data engineering Nanodegree
☆13May 28, 2020Updated 6 years ago
LinkedInLearning / end-to-end-data-engineering-project-4413618
View on GitHub
This repo is for the Linkedin Learning course: End-to-End Data Engineering Project
☆36Nov 9, 2023Updated 2 years ago
NetEase / lakehouse-benchmark
View on GitHub
A benchmark tool for lakehouses.
☆14Mar 12, 2023Updated 3 years ago
shashank-mishra219 / Confluent-Kafka-Setup
View on GitHub
☆14Oct 1, 2022Updated 3 years ago
rockthejvm / spark-performance-tuning
View on GitHub
The official repository for the Rock the JVM Spark Optimization 2 course
☆45Jun 20, 2026Updated last month
fbraza / FruitDetect
View on GitHub
A deep learning app to detect fruit on camera
☆12Nov 12, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
PacktPublishing / Hands-on-Serverless-computing-with-Go
View on GitHub
Hands-on Serverless computing with Go [video], published by Packt
☆14Oct 28, 2022Updated 3 years ago
lsclovecode / Real-Time-Stock-Streaming-Pipeline
View on GitHub
☆17Feb 3, 2018Updated 8 years ago
GireeshS22 / TimeDistributed-CRNN
View on GitHub
To try CTC in Keras
☆19Apr 8, 2019Updated 7 years ago
paiml / testing-in-python
View on GitHub
Examples for the Testing In Python Book
☆13Mar 26, 2026Updated 3 months ago
faizanahemad / data-science
View on GitHub
Personal Repository of Data Science Projects
☆14May 8, 2019Updated 7 years ago
mdsohelmahmood / stock-price-predict
View on GitHub
☆11Feb 7, 2021Updated 5 years ago
ScholaNest / Spark-Programming-In-Python
View on GitHub
Apache Spark 3 - Spark Programming in Python for Beginners
☆515Jul 25, 2024Updated last year
Ahmeduddin3403 / data-engineering-study-material
View on GitHub
Comprehensive study materials covering core data engineering concepts, tools, and practices.
☆17Jan 20, 2026Updated 6 months ago
benniehaelen / delta-lake-up-and-running
View on GitHub
Companion repository for the book 'Delta Lake Up and Running'
☆50Apr 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PacktPublishing / DP-203-Azure-Data-Engineer-Associate-Certification-Guide
View on GitHub
Azure Data Engineer Associate Certification Guide, published by Packt
☆80Apr 22, 2026Updated 3 months ago
jodb / DatabricksAndAzureMapsWorkshop
View on GitHub
Repository for Databricks And Azure Maps Online Workshop Series
☆17Mar 21, 2022Updated 4 years ago
padmapria / RAG-Care-Mental-Wellness-Assistant
View on GitHub
End to End RAG LLM AI Assistant using LangChain, Llama3, Gemma2, OpenAI, FlaskAPI, Grafana
☆11Nov 24, 2025Updated 8 months ago
dogukannulu / glue_etl_job_data_catalog_s3
View on GitHub
Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog
☆13Aug 26, 2023Updated 2 years ago
stacktonic-com / stacktonic-dbt-example-project
View on GitHub
This dbt starter project template is using the Google Analytics 4 BigQuery exports as input for some practical examples / models to showc…
☆41Feb 9, 2026Updated 5 months ago
LearningJournal / Spark-Streaming-In-Scala
View on GitHub
Apache Spark 3 - Structured Streaming Course Material
☆46Sep 8, 2020Updated 5 years ago
rockthejvm / spark-streaming
View on GitHub
The official repository for the Rock the JVM Spark Streaming course
☆20Jun 3, 2026Updated last month
MINAADELMARKOS / Advanced_SQL_Queries
View on GitHub
☆21May 17, 2025Updated last year
sahandkhoshdel99 / Computer-Networks
View on GitHub
Includes Final Project (Python), Wireshark Labs, and Theoretical HWs
☆13Sep 27, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
rockthejvm / spark-optimization
View on GitHub
The official repository for the Rock the JVM Spark Optimization with Scala course
☆58Jun 20, 2026Updated last month
szymonzaczek / databricks-ci-cd
View on GitHub
Databricks CI/CD using Azure DevOps
☆21Nov 1, 2022Updated 3 years ago
wssbck / training-oreilly-iceberg
View on GitHub
companion repository for an Apache Iceberg video course
☆15Mar 5, 2025Updated last year
dbsys21 / databricks-lakehouse
View on GitHub
This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.
☆22Oct 15, 2024Updated last year
akashmehta10 / profiling_pyspark
View on GitHub
☆25Jul 9, 2023Updated 3 years ago
yeasy / seminar-talk
View on GitHub
Some seminar talk slides
☆19Mar 5, 2020Updated 6 years ago
noi-techpark / big-data-for-tourism
View on GitHub
☆12Jul 22, 2025Updated last year