A demonstration of an ELT (Extract, Load, Transform) pipeline
☆31Feb 19, 2024Updated 2 years ago
Alternatives and similar repositories for data-pipeline-demo
Users that are interested in data-pipeline-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Orchestrate Modal and OpenAI workloads with Dagster☆13Dec 11, 2024Updated last year
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- ETL to scrape a real estate website, process house prices and data, and build an ML model of the house prices.☆16Jul 11, 2022Updated 3 years ago
- This is the repo of the Weather app from my YouTube video☆19Jul 6, 2023Updated 2 years ago
- FInal project for data zoom camp 2024☆17Mar 31, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Aug 21, 2023Updated 2 years ago
- ☆16Nov 27, 2025Updated 4 months ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆28Jul 2, 2022Updated 3 years ago
- Advanced Vehicle Tracking and Detection System using ByteTrack, Supervision, and YOLO Algorithms☆10May 10, 2023Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆47Dec 11, 2023Updated 2 years ago
- This project is about building a dimensional data warehouse in BigQuery by transforming an OLTP system to an OLAP system, using dbt as ou…☆13Dec 11, 2023Updated 2 years ago
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆15Jan 4, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Oct 2, 2024Updated last year
- Functional Data Engineering tutorial in Python & Airflow.☆17Mar 24, 2023Updated 3 years ago
- Data Engineering Bootcamp☆31Aug 5, 2025Updated 8 months ago
- Practical Data Engineering: A Hands-On Real-Estate Project Guide☆799Mar 10, 2026Updated last month
- ☆214Aug 13, 2023Updated 2 years ago
- Building a machine learning model to classify failures☆13Mar 20, 2024Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆40Apr 29, 2024Updated last year
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 8 months ago
- Code for my "Efficient Data Processing in SQL" book.☆61Aug 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30Nov 16, 2023Updated 2 years ago
- data analytics and data engineering consulting hand book☆38Nov 28, 2023Updated 2 years ago
- Source code of webpro.nl☆11Oct 12, 2025Updated 6 months ago
- ☆21Mar 31, 2024Updated 2 years ago
- Cutting-edge, opinionated, and ambitious project builder for power users and researchers.☆16Feb 2, 2026Updated 2 months ago
- Final Project for Data Engineering Zoomcamp Course 2024 🧙🔥☆11Apr 17, 2024Updated 2 years ago
- ☆10May 24, 2021Updated 4 years ago
- A testing ground for Plotly Dash app development including app features and experimenting with dashboard visualizations.☆10Oct 15, 2023Updated 2 years ago
- Parsing Module of Microsoft SQL Server Transaction log☆11May 12, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for Data Engineering Interview Series☆37Oct 17, 2024Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- ☆10Jul 19, 2020Updated 5 years ago
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Jan 19, 2024Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆100Aug 11, 2019Updated 6 years ago
- Slow & local data allows you to move fast and deliver business value for the 99.9% of the data challenges.☆379Sep 30, 2025Updated 6 months ago