Sample repo for startdataengineering DE 101 free course
☆74Jun 24, 2024Updated last year
Alternatives and similar repositories for sde_de101_josephmachado
Users that are interested in sde_de101_josephmachado are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆109May 26, 2026Updated 3 weeks ago
- Code for "Advanced data transformations in SQL" free live workshop☆93May 5, 2025Updated last year
- Step by step instructions to create a production-ready data pipeline☆62Dec 23, 2024Updated last year
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 months ago
- Code for DE101 book at https://de101.startdataengineering.com/☆110Feb 22, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated 2 years ago
- Repository for Data Engineering Interview Series☆40Oct 17, 2024Updated last year
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Code for dbt tutorial☆180Jun 4, 2026Updated 2 weeks ago
- ☆14Dec 11, 2023Updated 2 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Process manager and website for hosting multiple Streamlit apps☆13Jun 28, 2023Updated 2 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆292Jul 11, 2024Updated last year
- Une liste de projets data professionnels pour enrichir ton portfolio☆59Apr 11, 2026Updated 2 months ago
- Daily updated fake data for DBT learning and projects☆35Jan 7, 2024Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- Code for "Efficient Data Processing in Spark" Course☆385May 25, 2026Updated 3 weeks ago
- A course in data warehouse☆21Sep 27, 2025Updated 8 months ago
- ☆10May 3, 2025Updated last year
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆39Apr 16, 2023Updated 3 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- ☆22Oct 21, 2024Updated last year
- ETL using Python in Jupyter Notebook, loading CSV, cleaning data, and saving to SQL Database.☆14Nov 17, 2020Updated 5 years ago
- A web extension for converting PUPSIS schedule to ICalendar (.ics), csv and json file.☆16Mar 12, 2025Updated last year
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated 2 years ago
- ☆14Jan 27, 2026Updated 4 months ago
- A simple playground for dbt with the sqlite connector☆12May 22, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Feb 3, 2024Updated 2 years ago
- Python repo for the XDK auto-generated code.☆34Feb 28, 2026Updated 3 months ago
- use old jquery plugins with nextjs☆10Jan 6, 2023Updated 3 years ago
- Lyrics Generator based on GPT-2☆10Jun 20, 2023Updated 2 years ago
- ☆18Mar 7, 2025Updated last year
- learning-by-doing data model built with dbt-core☆17Apr 10, 2026Updated 2 months ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆44Apr 22, 2023Updated 3 years ago