Code for DE101 book at https://de101.startdataengineering.com/
☆91Feb 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for data_engineering_for_beginners_code
Users that are interested in data_engineering_for_beginners_code are comparing it to the libraries listed below
Sorting:
- ☆15Mar 29, 2024Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- ☆31Oct 23, 2025Updated 4 months ago
- ☆10Mar 19, 2023Updated 3 years ago
- End to End Data engineering projects in Google cloud environment☆27Nov 17, 2025Updated 4 months ago
- ☆26Sep 28, 2023Updated 2 years ago
- ☆21Aug 8, 2024Updated last year
- Repository to accompany the PHP API Pro course☆18Jul 3, 2024Updated last year
- Cloud Functions streaming insert to BigQuery (with Cloud Pub/Sub trigger). In this example, the function will make a REST API call to get…☆28Aug 28, 2023Updated 2 years ago
- In this repository we store all materials for dlt workshops, courses, etc.☆255Mar 11, 2026Updated last week
- Source code of webpro.nl☆11Oct 12, 2025Updated 5 months ago
- Code base for CDE bootcamp☆74Jan 17, 2026Updated 2 months ago
- Face Recognition Using CNN in Real-Time Videos☆22Feb 14, 2025Updated last year
- ☆19Feb 25, 2022Updated 4 years ago
- Data Engineering project using Databricks PySpark & Spark SQL for analysing data from Spotify API and present in form of PowerBI report☆42Nov 26, 2025Updated 3 months ago
- An end-to-end, containerized data pipeline for near-real-time user event analytics using Kafka, ClickHouse, Airflow, and PySpark. Made to…☆56Sep 12, 2025Updated 6 months ago
- ☆11Feb 13, 2019Updated 7 years ago
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Jan 19, 2024Updated 2 years ago
- The basis of this project involves analyzing Amgen future profitability based on its current business environment and financial performan…☆12Jul 26, 2019Updated 6 years ago
- A lightweight and flexible analysis pipeline☆12Jan 22, 2026Updated last month
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- Open Data Stack Projects: Examples of End to End Data Engineering Projects☆91Jun 25, 2023Updated 2 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆16Jun 19, 2022Updated 3 years ago
- Recommendations for Power BI Service Administration☆21Nov 8, 2022Updated 3 years ago
- Data pipeline to build a data warehouse on Postgres☆14Aug 11, 2024Updated last year
- ☆24Feb 20, 2026Updated last month
- ☆22Jul 27, 2025Updated 7 months ago
- ☆10Jan 24, 2023Updated 3 years ago
- GitHub mirror of Metadata indexer☆16Mar 5, 2026Updated 2 weeks ago
- Building a Data Pipeline with an Open Source Stack☆57Jun 27, 2025Updated 8 months ago
- Fake Pandas / PySpark DataFrame creator☆48Mar 10, 2024Updated 2 years ago
- My dotfiles☆14Updated this week
- Parses a valid YAML string into a struct which implements the DeserializeOwned trait from serde☆17Aug 9, 2025Updated 7 months ago
- Examples, samples and write ups to help educate and accelerate development and adoption of power platform including Canvas Apps, Model Ap…☆34Jan 18, 2024Updated 2 years ago
- This Repo contains tools that allow us to import, clean, manipulate, and visualize data —Includes Python libraries, like pandas, NumPy, M…☆13Jul 7, 2024Updated last year
- Design and implementation of FAIR Data Cube☆11Jun 2, 2025Updated 9 months ago
- An MOOC offered by the University of Helsinki. Course information can be found below☆10Jun 10, 2021Updated 4 years ago
- This is the HTML-CSS source code to build my personal website.☆10Nov 13, 2025Updated 4 months ago
- Transform your private channel to a public one with all your history☆13Apr 28, 2022Updated 3 years ago