Sample repo for startdataengineering DE 101 free course
☆74Jun 24, 2024Updated last year
Alternatives and similar repositories for sde_de101_josephmachado
Users that are interested in sde_de101_josephmachado are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆104Jun 7, 2024Updated last year
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated last year
- Cost Efficient Data Pipelines with DuckDB☆63May 14, 2025Updated 11 months ago
- Step by step instructions to create a production-ready data pipeline☆60Dec 23, 2024Updated last year
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for DE101 book at https://de101.startdataengineering.com/☆100Feb 22, 2026Updated 2 months ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Code for dbt tutorial☆174Sep 9, 2025Updated 8 months ago
- ☆14Dec 11, 2023Updated 2 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- Sample project to demonstrate data engineering best practices☆217Feb 24, 2024Updated 2 years ago
- Une liste de projets data professionnels pour enrichir ton portfolio☆53Apr 11, 2026Updated 3 weeks ago
- Process manager and website for hosting multiple Streamlit apps☆13Jun 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆292Jul 11, 2024Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- Code for "Efficient Data Processing in Spark" Course☆376Oct 16, 2025Updated 6 months ago
- A course in data warehouse☆19Sep 27, 2025Updated 7 months ago
- ☆10May 3, 2025Updated last year
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆38Apr 16, 2023Updated 3 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- Repository for the D ONE MLOps AWS BlogPost☆11Updated this week
- Example repo to create end to end tests for data pipeline.☆25Jun 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆21Apr 14, 2025Updated last year
- Data Engineering Project to Extract and Process Solana Reddit Data☆39Feb 3, 2024Updated 2 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- ☆18Mar 7, 2025Updated last year
- Google Cloud Dataflow Examples☆13May 19, 2016Updated 9 years ago
- This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…☆43Apr 22, 2023Updated 3 years ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆125Mar 31, 2025Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Jan 4, 2024Updated 2 years ago
- Creates csv files containing football data scraped from the website www.fbref.com☆23Jul 4, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Course notes for the Astronomer Certification DAG Authoring for Apache Airflow☆56Mar 21, 2024Updated 2 years ago
- Primary repository for NYC DCP's Data Engineering team☆39May 1, 2026Updated last week
- Code samples for Ingest data with Microsoft Fabric notebooks☆10Jul 21, 2023Updated 2 years ago
- ☆12Mar 6, 2021Updated 5 years ago
- This repository goes over how to handle massive variety in data engineering☆320Jan 16, 2023Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆49Apr 5, 2026Updated last month
- ☆27Jan 28, 2025Updated last year