This is a repo with links to everything you'd ever want to learn about data engineering
☆41,171Apr 2, 2026Updated last month
Alternatives and similar repositories for data-engineer-handbook
Users that are interested in data-engineer-handbook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Jo…☆40,481Apr 20, 2026Updated 2 weeks ago
- This repository helps teach people how to correctly define and create cumulative tables!☆762Oct 29, 2024Updated last year
- A curated list of data engineering tools for software developers☆8,582Apr 5, 2026Updated 3 weeks ago
- Data Engineering Practice Problems☆2,653Jan 8, 2025Updated last year
- This is a public repository to go over all the LLM-driven data engineering concepts.☆1,148Oct 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The best place to learn data engineering. Built and maintained by the data engineering community.☆1,926Apr 6, 2026Updated 3 weeks ago
- An Awesome List of Open-Source Data Engineering Projects☆3,166Oct 4, 2024Updated last year
- The Data Engineering Cookbook☆15,071Jan 17, 2026Updated 3 months ago
- All the resources you need to get to Senior Engineer and beyond☆17,333Apr 18, 2026Updated 2 weeks ago
- A list of useful resources to learn Data Engineering from scratch☆3,990Jun 19, 2024Updated last year
- 21 Lessons, Get Started Building with Generative AI☆110,167Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆78,872Feb 5, 2026Updated 2 months ago
- ☆1,064Dec 31, 2025Updated 4 months ago
- This is a repo with links to everything you'd ever want to learn about data engineering☆801Dec 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repo has all the resources you need to become an amazing analytics engineer!☆340Feb 17, 2026Updated 2 months ago
- 100+ AI Agent & RAG apps you can actually run — clone, customize, ship.☆108,113Apr 27, 2026Updated last week
- Roadmap to becoming a data engineer in 2021☆12,754Jan 25, 2022Updated 4 years ago
- Free MLOps course from DataTalks.Club☆14,519Dec 1, 2025Updated 5 months ago
- A guide for technical professionals looking to start consulting☆1,527Jan 3, 2025Updated last year
- Implementing best practices for PySpark ETL jobs and applications.☆2,097Jan 1, 2023Updated 3 years ago
- List of books, blogs, newsletters and people!☆6,141Mar 31, 2026Updated last month
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆28,150Sep 30, 2025Updated 7 months ago
- Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.☆346,793Mar 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This repository goes over how to handle massive variety in data engineering☆320Jan 16, 2023Updated 3 years ago
- Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.☆82,414Apr 4, 2025Updated last year
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. …☆34,344Mar 26, 2026Updated last month
- Anthropic's educational courses☆20,979Nov 13, 2025Updated 5 months ago
- Learn ML engineering for free in 4 months! Register here 👇🏼☆13,001Dec 27, 2025Updated 4 months ago
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆26,428Apr 24, 2026Updated last week
- If you want to become good at system design, join this newsletter now 👇☆24,313Updated this week
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆34,407Mar 23, 2026Updated last month
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆91,680Apr 16, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🙌 OpenHands: AI-Driven Development☆72,542Updated this week
- More than 2000+ Data engineer interview questions.☆1,586Jan 13, 2026Updated 3 months ago
- Master programming by recreating your favorite technologies from scratch.☆498,564Feb 21, 2026Updated 2 months ago
- Get your documents ready for gen AI☆59,087Updated this week
- Free, simple, and intuitive online database diagram editor and SQL generator.☆37,127Apr 22, 2026Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆64,964Updated this week
- Run agents as production software.☆39,835Updated this week