Code for DE101 book at https://de101.startdataengineering.com/
☆95Feb 22, 2026Updated last month
Alternatives and similar repositories for data_engineering_for_beginners_code
Users that are interested in data_engineering_for_beginners_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Mar 29, 2024Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- ☆10Mar 19, 2023Updated 3 years ago
- Sample repo for startdataengineering DE 101 free course☆74Jun 24, 2024Updated last year
- All the ressources and guide to practice the Patou Tips☆30Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repository for the D ONE MLOps AWS BlogPost☆11Aug 13, 2024Updated last year
- ☆26Sep 28, 2023Updated 2 years ago
- ☆82Aug 22, 2024Updated last year
- ☆21Aug 8, 2024Updated last year
- ☆12Dec 7, 2023Updated 2 years ago
- An expert system using knowledge graphs that aims to provide the patients with medical advice and basic knowledge on various diseases☆16May 4, 2020Updated 5 years ago
- In this repository we store all materials for dlt workshops, courses, etc.☆256Mar 11, 2026Updated last month
- ☆19Feb 25, 2022Updated 4 years ago
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Jan 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆47Nov 26, 2025Updated 4 months ago
- A script/docker that automatically translates PDFs using the DeepL API☆12Jan 18, 2026Updated 2 months ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- Data Analysis and Image Processing Python Course☆12Nov 4, 2014Updated 11 years ago
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- ☆25Apr 4, 2026Updated last week
- ☆10Jan 24, 2023Updated 3 years ago
- GitHub mirror of Metadata indexer☆16Mar 17, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Building a Data Pipeline with an Open Source Stack☆58Jun 27, 2025Updated 9 months ago
- The "World Data Report" is a Power BI project that offers a detailed overview of global data, covering weather, geographical, demographic…☆15Nov 30, 2025Updated 4 months ago
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- An MOOC offered by the University of Helsinki. Course information can be found below☆10Jun 10, 2021Updated 4 years ago
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 4 months ago
- An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to…☆57Sep 12, 2025Updated 6 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆103Jun 7, 2024Updated last year
- ☆23Feb 16, 2025Updated last year
- Sémantický slovník pojmů☆13Apr 2, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for participants of the "Containers for HPC" training☆11Feb 11, 2026Updated 2 months ago
- A local-first, terminal-based password manager built for people who care about security, simplicity, and control☆37Dec 31, 2025Updated 3 months ago
- Roadmap for all those who want to get a kick start as Data Scientist.☆11Feb 2, 2022Updated 4 years ago
- Triplestore wrapper package for Python.☆13Updated this week
- ☆22Sep 26, 2021Updated 4 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- Identify and tokenize sensitive data automatically using Cloud DLP and Dataflow☆45Oct 27, 2025Updated 5 months ago