This is a repo with links to everything you'd ever want to learn about data engineering
☆799Dec 18, 2024Updated last year
Alternatives and similar repositories for data-engineer-handbook-0326
Users that are interested in data-engineer-handbook-0326 are comparing it to the libraries listed below
Sorting:
- This is a repo with links to everything you'd ever want to learn about data engineering☆40,530Feb 26, 2026Updated 3 weeks ago
- This repository helps teach people how to correctly define and create cumulative tables!☆757Oct 29, 2024Updated last year
- Main repository to collect notes and scripts written during DataExpert.IO January 2025 bootcamp to help anyone interested.☆37Apr 8, 2025Updated 11 months ago
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆39,165Mar 12, 2026Updated last week
- This repo has all the resources you need to become an amazing analytics engineer!☆319Feb 17, 2026Updated last month
- Local development environment for python data projects, with Docker☆23Dec 14, 2022Updated 3 years ago
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,134Oct 26, 2024Updated last year
- This uses the Polygon.io API to extract data about stocks☆35Sep 26, 2025Updated 5 months ago
- Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes s…☆40Feb 25, 2026Updated 3 weeks ago
- ☆18Jan 18, 2025Updated last year
- Solutions for Data Engineering Zoomcamp, Winter 2022.☆16Apr 22, 2022Updated 3 years ago
- A guide for technical professionals looking to start consulting☆1,514Jan 3, 2025Updated last year
- ☆10May 3, 2025Updated 10 months ago
- Course Materials for Analytics in Stock Markets Zoomcamp☆833Oct 4, 2025Updated 5 months ago
- Helper for handling PySpark DataFrame partition size 📑🎛️☆12Mar 8, 2024Updated 2 years ago
- LLM Zoomcamp - a free online course about real-life applications of LLMs. In 10 weeks you will learn how to build an AI system that answe…☆4,722Dec 1, 2025Updated 3 months ago
- ☆115Jul 6, 2025Updated 8 months ago
- Capstone Project for DataExpert.io V4 Cohort☆13Jul 8, 2024Updated last year
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- Data Engineering Practice Problems☆2,574Jan 8, 2025Updated last year
- ☆21Jan 27, 2026Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, Mage, dbt, DuckDB and Superset☆55Mar 9, 2026Updated last week
- Data engineering mentorship program☆287Aug 2, 2024Updated last year
- ☆23Apr 7, 2025Updated 11 months ago
- Hands-on examples and exercises from the book "Databricks Certified Data Engineer Associate Study Guide" published by O'Reilly Media.☆69Mar 25, 2025Updated 11 months ago
- ☆12Apr 8, 2025Updated 11 months ago
- ☆12Jul 6, 2017Updated 8 years ago
- More than 2000+ Data engineer interview questions.☆1,547Jan 13, 2026Updated 2 months ago
- ☆14Oct 10, 2025Updated 5 months ago
- capstone project for Dataengineer.io bootcamp Public Repo☆12Feb 20, 2024Updated 2 years ago
- An Awesome List of Open-Source Data Engineering Projects☆3,046Oct 4, 2024Updated last year
- Beginner data engineering project - batch edition☆567Mar 12, 2026Updated last week
- A curated list of Large Language Model resources, covering model training, serving, fine-tuning, and building LLM applications.☆4,745Aug 18, 2025Updated 7 months ago
- This repo demonstrates the development of a real-time data pipeline designed to ingest, process, and analyze stock market data. Using cut…☆49Sep 2, 2024Updated last year
- Personal Data Engineering Projects☆1,001Feb 8, 2023Updated 3 years ago
- ☆372May 8, 2023Updated 2 years ago
- Case study solutions for #8WeekSQLChallenge by Danny Ma.☆43Jan 9, 2023Updated 3 years ago
- Projeto destinado ao canal do YouTube 'Nerds sem estudos' que tem a finalidade de trazer conceitos fundamentais de codificação e conhecim…☆34Mar 3, 2025Updated last year
- This repo contains all the code used in the Python for Data Engineering Course☆349Apr 24, 2024Updated last year