Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here ππΌ
β38,735Updated this week
Alternatives and similar repositories for data-engineering-zoomcamp
Users that are interested in data-engineering-zoomcamp are comparing it to the libraries listed below
Sorting:
- Free MLOps course from DataTalks.Clubβ14,259Dec 1, 2025Updated 3 months ago
- Learn ML engineering for free in 4 months! Register here ππΌβ12,671Dec 27, 2025Updated 2 months ago
- This is a repo with links to everything you'd ever want to learn about data engineeringβ40,293Dec 15, 2025Updated 2 months ago
- LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answeβ¦β4,642Dec 1, 2025Updated 3 months ago
- Data Engineering Practice Problemsβ2,547Jan 8, 2025Updated last year
- The Data Engineering Cookbookβ14,959Jan 17, 2026Updated last month
- A list of useful resources to learn Data Engineering from scratchβ3,952Jun 19, 2024Updated last year
- A curated list of data engineering tools for software developersβ8,325Feb 21, 2026Updated last week
- Roadmap to becoming a data engineer in 2021β12,745Jan 25, 2022Updated 4 years ago
- An Awesome List of Open-Source Data Engineering Projectsβ3,020Oct 4, 2024Updated last year
- The best place to learn data engineering. Built and maintained by the data engineering community.β1,897Jan 31, 2026Updated last month
- Course Materials for Analytics in Stock Markets Zoomcampβ828Oct 4, 2025Updated 4 months ago
- Personal Data Engineering Projectsβ993Feb 8, 2023Updated 3 years ago
- Example end to end data engineering project.β1,387Dec 8, 2022Updated 3 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developmeβ¦β1,826Aug 26, 2022Updated 3 years ago
- β371May 8, 2023Updated 2 years ago
- π Papers & tech blogs by companies sharing their work on data science & machine learning in production.β28,698Jul 18, 2024Updated last year
- 10 Weeks, 20 Lessons, Data Science for All!β34,014Updated this week
- Learn by doing: DIY project groups at DataTalks.Clubβ414May 24, 2024Updated last year
- More than 2000+ Data engineer interview questions.β1,524Jan 13, 2026Updated last month
- 12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for allβ84,015Updated this week
- πΊ Discover the latest machine learning / AI courses on YouTube.β17,101Jan 22, 2024Updated 2 years ago
- π§ Build, run, and manage data pipelines for integrating and transforming data.β8,653Feb 20, 2026Updated last week
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!β842Apr 16, 2022Updated 3 years ago
- A curated list of references for MLOpsβ13,714Nov 21, 2024Updated last year
- Data science interview questions and answersβ9,779Feb 19, 2026Updated last week
- Implementing best practices for PySpark ETL jobs and applications.β2,074Jan 1, 2023Updated 3 years ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.β75,460Feb 5, 2026Updated 3 weeks ago
- Learn how to design, develop, deploy and iterate on production-grade ML applications.β46,426Aug 18, 2024Updated last year
- β8,653Sep 22, 2024Updated last year
- data load tool (dlt) is an open source Python library that makes data loading easy π οΈβ4,949Updated this week
- Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastiβ¦β81,221Dec 27, 2025Updated 2 months ago
- π A ranked list of awesome machine learning Python libraries. Updated weekly.β23,250Updated this week
- List of Computer Science courses with video lectures.β75,235Feb 21, 2026Updated last week
- An awesome Data Science repository to learn and apply for real world problems.β28,451Updated this week
- Learn how to design, develop, deploy and iterate on production-grade ML applications.β3,300Aug 16, 2024Updated last year
- Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.β336,588Nov 3, 2025Updated 3 months ago
- All Algorithms implemented in Pythonβ218,211Feb 2, 2026Updated 3 weeks ago
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.β21,652Feb 21, 2026Updated last week