airscholar / modern-data-eng-dbt-databricks-azureLinks

In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our cloud provider.

☆35

Alternatives and similar repositories for modern-data-eng-dbt-databricks-azure

Users that are interested in modern-data-eng-dbt-databricks-azure are comparing it to the libraries listed below

Sorting:

airscholar / realtime-voting-data-engineering
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…
☆42Updated last year
HamzaG737 / data-engineering-project
End to end data engineering project with kafka, airflow, spark, postgres and docker.
☆102Updated 7 months ago
airscholar / e2e-data-engineering
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…
☆285Updated 8 months ago
airscholar / RealtimeStreamingEngineering
This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…
☆44Updated last year
afaqueahmad7117 / spark-experiments
Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews
☆170Updated last month
airscholar / RedditDataEngineering
This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…
☆159Updated 2 years ago
josephmachado / data_engineering_best_practices
Sample project to demonstrate data engineering best practices
☆197Updated last year
raveendratal / PysparkRaveendra
Git Repository
☆147Updated last month
josephmachado / online_store
End to end data engineering project
☆57Updated 3 years ago
SatadruMukherjee / Data-Preprocessing-Models
☆70Updated this week
LearningJournal / Apache-Spark-and-Databricks-Stream-Processing-in-Lakehouse
☆56Updated last year
darshilparmar / twitter-airflow-data-engineering-project
YouTube tutorial project
☆105Updated 2 years ago
Amrit-Hub / Databricks-Certified-Data-Engineer-Professional-Questions
This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.
☆117Updated last year
alanchn31 / Movalytics-Data-Warehouse
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
☆157Updated 5 years ago
martandsingh / ApacheSpark
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…
☆102Updated last month
simardeep1792 / Data-Engineering-Streaming-Project
☆44Updated last year
josephmachado / python_essentials_for_data_engineers
Code for blog at https://www.startdataengineering.com/post/python-for-de/
☆87Updated last year
josephmachado / adv_data_transformation_in_sql
Code for "Advanced data transformations in SQL" free live workshop
☆84Updated 5 months ago
derar-alhussein / Databricks-Certified-Data-Engineer-Professional
The resources of the preparation course for Databricks Data Engineer Professional certification exam
☆142Updated 4 months ago
Snowflake-Labs / sfguide-data-engineering-with-snowpark-python
☆140Updated 8 months ago
raashidsalih / churn-pipeline
A custom end-to-end analytics platform for customer churn
☆11Updated 5 months ago
kroudir / Data-Engineer-Nanodegree-Projects-Udacity
Projects done in the Data Engineer Nanodegree Program by Udacity.com
☆164Updated 2 years ago
airscholar / changecapture-e2e
This project shows how to capture changes from postgres database and stream them into kafka
☆38Updated last year
abdkumar / spotify-stream-analytics
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consu…
☆69Updated last year
airscholar / Kubernetes-For-DataEngineering
This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…
☆22Updated last year
josephmachado / beginner_de_project_stream
Simple stream processing pipeline
☆110Updated last year
josephmachado / sde_de101_josephmachado
Sample repo for startdataengineering DE 101 free course
☆69Updated last year
derar-alhussein / Databricks-Certified-Data-Engineer-Associate
The resources of the preparation course for Databricks Data Engineer Associate certification exam
☆507Updated last month
RSKriegs / finnhub-streaming-data-pipeline
Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
☆365Updated last year
itversity / data-engineering-spark
☆88Updated 3 years ago