vajol / python-data-engineering-resources
A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatly arranged selection of frameworks, libraries, and tools crucial for machine learning, ETL, ORM, data/schema validation, database migration, and more, all focused on Python.
☆101Updated last year
Alternatives and similar repositories for python-data-engineering-resources
Users that are interested in python-data-engineering-resources are comparing it to the libraries listed below
Sorting:
- Sample project to demonstrate data engineering best practices☆190Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆81Updated last week
- ☆144Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆76Updated 11 months ago
- Code for dbt tutorial☆157Updated 11 months ago
- In this repository we store all materials for dlt workshops, courses, etc.☆165Updated 3 weeks ago
- Some example projects for Data Engineers to build, end-to-end.☆29Updated last year
- Django-based course management platform for Zoomcamps☆67Updated this week
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 9 months ago
- End to end data engineering project☆54Updated 2 years ago
- Template for Data Engineering and Data Pipeline projects☆110Updated 2 years ago
- ☆130Updated 3 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆37Updated last year
- Data Modeling with Snowflake, published by Packt☆65Updated last month
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆197Updated last week
- ☆117Updated 9 months ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆261Updated 10 months ago
- Code to demonstrate data engineering metadata & logging best practices☆16Updated last year
- ☆139Updated 2 years ago
- Step by step instructions to create a production-ready data pipeline☆50Updated 4 months ago
- Repository for Data Engineering Zoomcamp 2024☆14Updated last year
- Building ETL Pipelines with Python☆138Updated 10 months ago
- This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksCl…☆98Updated 8 months ago
- ☆12Updated last year
- Project for "Data pipeline design patterns" blog.☆45Updated 9 months ago
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆141Updated 9 months ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 7 months ago
- Code for "Efficient Data Processing in Spark" Course☆299Updated 7 months ago
- Data engineering with dbt, published by Packt☆77Updated last year
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆225Updated last month