vajol / python-data-engineering-resources
A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatly arranged selection of frameworks, libraries, and tools crucial for machine learning, ETL, ORM, data/schema validation, database migration, and more, all focused on Python.
☆99Updated last year
Alternatives and similar repositories for python-data-engineering-resources:
Users that are interested in python-data-engineering-resources are comparing it to the libraries listed below
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 10 months ago
- Code for "Advanced data transformations in SQL" free live workshop☆77Updated 6 months ago
- Sample project to demonstrate data engineering best practices☆186Updated last year
- Django-based course management platform for Zoomcamps☆64Updated 3 weeks ago
- Some example projects for Data Engineers to build, end-to-end.☆28Updated last year
- ☆144Updated last year
- Code for dbt tutorial☆156Updated 10 months ago
- Step by step instructions to create a production-ready data pipeline☆45Updated 4 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆29Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆36Updated 11 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- ☆204Updated 3 months ago
- In this repository we store all materials for dlt workshops, courses, etc.☆155Updated this week
- Project for "Data pipeline design patterns" blog.☆45Updated 8 months ago
- Template for Data Engineering and Data Pipeline projects☆109Updated 2 years ago
- Code to demonstrate data engineering metadata & logging best practices☆16Updated last year
- Sample repo for startdataengineering DE 101 free course☆58Updated 10 months ago
- End to end data engineering project☆54Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆67Updated last year
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- ☆22Updated last year
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆27Updated 2 years ago
- Building ETL Pipelines with Python☆132Updated 9 months ago
- Repo for CDC with debezium blog post☆28Updated 7 months ago
- ☆16Updated 11 months ago
- Full stack data engineering tools and infrastructure set-up☆51Updated 4 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Dagster University courses☆76Updated last week
- ☆151Updated 2 years ago