Code for DE101 book at https://de101.startdataengineering.com/
☆105Feb 22, 2026Updated 2 months ago
Alternatives and similar repositories for data_engineering_for_beginners_code
Users that are interested in data_engineering_for_beginners_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official ClickHouse Agentic Data Stack - self-host with ClickHouse, LibreChat, Langfuse, and ClickHouse MCP.☆61May 13, 2026Updated last week
- ☆15Mar 29, 2024Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- A framework to manage data, continuously☆35Jan 20, 2025Updated last year
- ☆10Mar 19, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- All the ressources and guide to practice the Patou Tips☆34May 3, 2026Updated 2 weeks ago
- ☆26Sep 28, 2023Updated 2 years ago
- Repository to accompany the Test Driven Development with Pest course☆13Nov 23, 2023Updated 2 years ago
- ☆21Aug 8, 2024Updated last year
- Minimalistic implementation for LLM-based chat assistants with Tool Use (function calling) and MCP☆30Apr 21, 2026Updated last month
- Cloud Functions streaming insert to BigQuery (with Cloud Pub/Sub trigger). In this example, the function will make a REST API call to get…☆28Aug 28, 2023Updated 2 years ago
- Source code of webpro.nl☆11Oct 12, 2025Updated 7 months ago
- Face Recognition Using CNN in Real-Time Videos☆22Feb 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- My old blog is too hard to maintain. So, why not make a new one?☆15Jul 5, 2023Updated 2 years ago
- ☆19Feb 25, 2022Updated 4 years ago
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆51Nov 26, 2025Updated 5 months ago
- ☆11Feb 13, 2019Updated 7 years ago
- Download closed captions from youtube videos (both manual and automatically generated), python implementation☆17Jan 1, 2022Updated 4 years ago
- A script/docker that automatically translates PDFs using the DeepL API☆13Updated this week
- Moving to Microsoft Teams from Slack or starting fresh? You've come to the right place.☆14Jan 7, 2025Updated last year
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- A lightweight and flexible analysis pipeline☆12May 14, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆90Jun 25, 2023Updated 2 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- Relative paths according to the CURRENT FILE, not the current shell location.☆23Apr 11, 2024Updated 2 years ago
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- A notebook testing CPU speed vs GPU speed with Pytorch and CUDA☆18Dec 25, 2021Updated 4 years ago
- Apache Ignite Quick Start Guide, published by Packt☆12Jan 30, 2023Updated 3 years ago
- create issues from pytest-reportlog files☆14Feb 10, 2026Updated 3 months ago
- ☆22Jul 27, 2025Updated 9 months ago
- iqair dataset☆41Apr 24, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Building a Data Pipeline with an Open Source Stack☆59Jun 27, 2025Updated 10 months ago
- My dotfiles☆14Updated this week
- LLM extensions for Sphinx Documentation☆21Apr 2, 2026Updated last month
- This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, M…☆13Jul 7, 2024Updated last year
- Design and implementation of FAIR Data Cube☆11Jun 2, 2025Updated 11 months ago
- An MOOC offered by the University of Helsinki. Course information can be found below☆10Jun 10, 2021Updated 4 years ago
- DuckDB WebMacro: Share and Load your SQL Macros via gists☆15Mar 24, 2026Updated last month