vajol / python-data-engineering-resourcesLinks
A handpicked collection of resources for Python developers in data engineering, machine learning, and AI. Inside, you'll discover a neatly arranged selection of frameworks, libraries, and tools crucial for machine learning, ETL, ORM, data/schema validation, database migration, and more, all focused on Python.
☆106Updated last year
Alternatives and similar repositories for python-data-engineering-resources
Users that are interested in python-data-engineering-resources are comparing it to the libraries listed below
Sorting:
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆85Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆83Updated 3 months ago
- Some example projects for Data Engineers to build, end-to-end.☆34Updated last year
- Code for my "Efficient Data Processing in SQL" book.☆58Updated last year
- In this repository we store all materials for dlt workshops, courses, etc.☆219Updated 2 weeks ago
- ☆206Updated 7 months ago
- Sample project to demonstrate data engineering best practices☆197Updated last year
- Django-based course management platform for Zoomcamps☆68Updated 2 weeks ago
- Code snippets for Data Engineering Design Patterns book☆150Updated 5 months ago
- Code for DE101 book at https://de101.startdataengineering.com/☆49Updated 3 weeks ago
- Code for dbt tutorial☆159Updated 2 months ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆39Updated last year
- End to end data engineering project☆57Updated 2 years ago
- ☆145Updated last year
- Repository for Data Engineering Interview Series☆31Updated 10 months ago
- Template for Data Engineering and Data Pipeline projects☆114Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆692Updated 11 months ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆30Updated last year
- AWS ETL Pipleine☆30Updated last year
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆274Updated last year
- Code for "Efficient Data Processing in Spark" Course☆336Updated 3 months ago
- end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence☆219Updated 2 months ago
- This repo has all the resources you need to become an amazing analytics engineer!☆253Updated last year
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆76Updated 2 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated 4 months ago
- This is a code repository for the course Data Engineering with Data Build Tool (DBT).☆61Updated last year
- ☆135Updated last week
- ☆12Updated last year
- Get started with dbt in less than 1 minute from `git clone` to `dbt docs serve` for free!☆221Updated 3 months ago