Simple stream processing pipeline
☆113Jun 17, 2024Updated last year
Alternatives and similar repositories for beginner_de_project_stream
Users that are interested in beginner_de_project_stream are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 months ago
- Simple repo to demonstrate how to submit a spark job to EMR from Airflow☆34Oct 18, 2020Updated 5 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 9 months ago
- Cost Efficient Data Pipelines with DuckDB☆61May 14, 2025Updated last year
- Repo for CDC with debezium blog post☆29Sep 15, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- Code for dbt tutorial☆179Jun 4, 2026Updated last week
- Repository for Data Engineering Interview Series☆39Oct 17, 2024Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆107May 26, 2026Updated 2 weeks ago
- Step by step instructions to create a production-ready data pipeline☆61Dec 23, 2024Updated last year
- Data pipeline to build a data warehouse on Postgres☆15Aug 11, 2024Updated last year
- rust-for-data☆53Jul 12, 2023Updated 2 years ago
- Code for data quality with greatexpectations blog☆13Jul 30, 2024Updated last year
- Sample project to demonstrate data engineering best practices☆220Feb 24, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆291Jul 11, 2024Updated last year
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆13May 24, 2024Updated 2 years ago
- ☆14Dec 11, 2023Updated 2 years ago
- ☆16Apr 26, 2024Updated 2 years ago
- ☆14Oct 1, 2022Updated 3 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 3 years ago
- ☆30Nov 16, 2023Updated 2 years ago
- Code/Notes for the Data Engineering Zoomcamp by DataTalksClub☆32Mar 16, 2023Updated 3 years ago
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for my "Efficient Data Processing in SQL" book.☆62Aug 6, 2024Updated last year
- ☆11Nov 21, 2023Updated 2 years ago
- An end-to-end workflow for processing streaming data on Azure.☆17Sep 20, 2024Updated last year
- Lecture Notes for DSML Jun22 Beginner's Intermediate module☆11Oct 14, 2022Updated 3 years ago
- ☆17Apr 1, 2025Updated last year
- Data Engineering with Python, published by Packt☆812Jan 30, 2023Updated 3 years ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆803Mar 10, 2026Updated 3 months ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆880Apr 16, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated 2 years ago
- Code for "Advanced data transformations in SQL" free live workshop☆92May 5, 2025Updated last year
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆435Nov 28, 2023Updated 2 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Feb 19, 2024Updated 2 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Jul 31, 2022Updated 3 years ago
- Repo for Introduction to Iceberg Video☆22Jun 3, 2024Updated 2 years ago
- MLOps.Community's reading group for Fundamentals of Data Engineering☆11Aug 3, 2022Updated 3 years ago