vajol / python-data-engineering-resourcesLinks
A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatly arranged selection of frameworks, libraries, and tools crucial for machine learning, ETL, ORM, data/schema validation, database migration, and more, all focused on Python.
☆106Updated last year
Alternatives and similar repositories for python-data-engineering-resources
Users that are interested in python-data-engineering-resources are comparing it to the libraries listed below
Sorting:
- Some example projects for Data Engineers to build, end-to-end.☆34Updated last year
- Code for "Efficient Data Processing in Spark" Course☆340Updated 4 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆87Updated last year
- In this repository we store all materials for dlt workshops, courses, etc.☆229Updated last month
- Sample project to demonstrate data engineering best practices☆198Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆84Updated 5 months ago
- ☆145Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆59Updated last year
- Code snippets for Data Engineering Design Patterns book☆207Updated 6 months ago
- Code for dbt tutorial☆161Updated last month
- Template for Data Engineering and Data Pipeline projects☆115Updated 2 years ago
- ☆210Updated 8 months ago
- Django-based course management platform for Zoomcamps☆69Updated last week
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆279Updated last year
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆709Updated last year
- End to end data engineering project☆57Updated 2 years ago
- This repo has all the resources you need to become an amazing analytics engineer!☆263Updated last year
- ☆120Updated 2 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Updated last year
- Building ETL Pipelines with Python☆162Updated last year
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆221Updated 5 months ago
- ☆190Updated 4 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆30Updated last year
- This repository goes over how to handle massive variety in data engineering☆303Updated 2 years ago
- A Data Engineering project. Repository for backend infrastructure and Streamlit app files for a Premier League Dashboard.☆247Updated last year
- Data Engineering with Databricks Cookbook, published by Packt☆107Updated last year
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆150Updated last year
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆78Updated 2 years ago
- Airflow 3 demos from DevRel☆73Updated 2 months ago
- Data engineering with dbt, published by Packt☆87Updated last month