josephmachado / sde_de101_josephmachadoView external linksLinks
Sample repo for startdataengineering DE 101 free course
☆74Jun 24, 2024Updated last year
Alternatives and similar repositories for sde_de101_josephmachado
Users that are interested in sde_de101_josephmachado are comparing it to the libraries listed below
Sorting:
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆98Jun 7, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆89May 5, 2025Updated 9 months ago
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated 9 months ago
- Code for DE101 book at https://de101.startdataengineering.com/☆83Dec 6, 2025Updated 2 months ago
- Step by step instructions to create a production-ready data pipeline☆58Dec 23, 2024Updated last year
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Repository for Data Engineering Interview Series☆36Oct 17, 2024Updated last year
- Process manager and website for hosting multiple Streamlit apps☆14Jun 28, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 9 months ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆286Jul 11, 2024Updated last year
- ☆21Oct 21, 2024Updated last year
- Code for "Efficient Data Processing in Spark" Course☆362Oct 16, 2025Updated 4 months ago
- End to end data engineering project☆58Oct 27, 2022Updated 3 years ago
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- ☆19Feb 25, 2022Updated 3 years ago
- Primary repository for NYC DCP's Data Engineering team☆33Updated this week
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Jun 16, 2025Updated 8 months ago
- Realtime social project with laravel, vuejs and pusher☆11Nov 24, 2018Updated 7 years ago
- Upload of all my presentations which I've been doing in the past☆10Feb 5, 2026Updated last week
- ☆11Jan 5, 2023Updated 3 years ago
- Repository for the dbt Semantic Layer course☆11Nov 13, 2025Updated 3 months ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- A web application to send single or group message to your contacts which is developed using CodeIgniter.☆12Nov 20, 2019Updated 6 years ago
- Kubernetes on AWS Workshop☆10Nov 3, 2017Updated 8 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Dec 27, 2023Updated 2 years ago
- datasets from my cyber security research papers☆10Jan 12, 2021Updated 5 years ago
- TSE-Simulator according to BSI TR-03153☆10Apr 21, 2021Updated 4 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- A website with php as a backend to manage blood bank and many other functionalities.☆11Sep 14, 2019Updated 6 years ago
- A Python package extending pandas with helper functions for simpler exploratory data analysis and data wrangling.☆10Feb 6, 2025Updated last year
- A script/docker that automatically translates PDFs using the DeepL API☆11Jan 18, 2026Updated 3 weeks ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- dbt tutorial using a local PostgreSQL database☆39Jun 4, 2022Updated 3 years ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆42Apr 22, 2023Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Dec 13, 2025Updated 2 months ago
- This repository goes over how to handle massive variety in data engineering☆314Jan 16, 2023Updated 3 years ago