karlchris / data-engineering
Data Engineering Handbook for beginners and everyone
☆30Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for data-engineering
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆241Updated 4 months ago
- where geeks hangout and discuss about data engineering☆39Updated last year
- Sample project to demonstrate data engineering best practices☆166Updated 8 months ago
- Code for "Efficient Data Processing in Spark" Course☆247Updated last month
- Django-based course management platform for Zoomcamps☆54Updated 2 weeks ago
- This is a template you can use for your next data engineering portfolio project.☆163Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆133Updated 4 years ago
- End to end data engineering project☆51Updated 2 years ago
- Data Engineering examples for Airflow, Prefect, and Mage.ai; dbt for BigQuery, Redshift, ClickHouse, PostgreSQL; Spark/PySpark for Batch …☆51Updated last week
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆55Updated 5 months ago
- ☆128Updated last year
- Sample repo for startdataengineering DE 101 free course☆35Updated 4 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆67Updated 3 months ago
- Recohut - Learn data engineering, data science☆93Updated last year
- Data pipeline for uploading, preprocessing, and visualising COVID19 data☆17Updated last year
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆96Updated 3 months ago
- FInal project for data zoom camp 2024☆18Updated 7 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆424Updated last month
- Source code of the Apache Airflow Tutorial for Beginners on YouTube Channel Coder2j (https://www.youtube.com/c/coder2j)☆273Updated 8 months ago
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆32Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆70Updated 6 months ago
- ☆15Updated 9 months ago
- Data Engineering with Google Cloud Platform, published by Packt☆109Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated 11 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆66Updated last month
- Code for dbt tutorial☆143Updated 5 months ago
- This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.☆39Updated 3 months ago
- ☆445Updated last month
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆204Updated last year