Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; Kafka for Stream processing
☆74Mar 9, 2026Updated last month
Alternatives and similar repositories for data-engineering-labs
Users that are interested in data-engineering-labs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streamlit-Powered DataTalksClub Project Analyzer: Interactive Insights at Your Fingertips☆20Apr 5, 2026Updated last week
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 2 years ago
- This repository contains the capstone project carried out as part of Machine Learning Zoomcamp course☆10Dec 26, 2022Updated 3 years ago
- Code for dbt tutorial☆172Sep 9, 2025Updated 7 months ago
- strategy backtesting framework☆12Oct 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- Code for the Data Engineering Zoomcamp☆48May 7, 2023Updated 2 years ago
- Display live blood sugar data from Nightscout in your system menu bar.☆28Mar 30, 2026Updated last week
- A custom end-to-end analytics platform for customer churn☆11May 15, 2025Updated 10 months ago
- The "One Role to Rule them all" Ansible setup☆12Sep 20, 2019Updated 6 years ago
- ☆26May 25, 2022Updated 3 years ago
- Ansible setup of my development workstation☆13Sep 22, 2023Updated 2 years ago
- This project provides valuable customer sentiment insights for Zomato by tracking and analyzing tweets related to their brand and service…☆14Aug 27, 2023Updated 2 years ago
- a lightweight, semi-automated setup guide for HashiStack: Consul + Vault + Nomad, on Footloose powered Docker "container VMs", with Ansib…☆11Jul 2, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Analytics Engineer Course☆20May 17, 2023Updated 2 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆22Aug 16, 2025Updated 7 months ago
- ☆32May 30, 2023Updated 2 years ago
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆25Dec 19, 2022Updated 3 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Jun 26, 2023Updated 2 years ago
- dbt integration for Cube☆16Oct 22, 2025Updated 5 months ago
- Detailed notes and homeworks from 2025 Data Engineering Zoomcamp by Datatalks.Club☆56Mar 10, 2025Updated last year
- My notes of the Data Engineering Zoomcamp by DataTalksClub☆38Apr 16, 2023Updated 2 years ago
- A POC framework to create Wordpress docker image with Ansible/Packer and deploy it to AWS ECS using Terraform☆17Jun 23, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Cloned by the `dbt init` task☆62Apr 28, 2024Updated last year
- Project was based on an interest in Data Engineering, ETL pipeline. It also provided a good opportunity to develop skills and experience…☆31Sep 12, 2023Updated 2 years ago
- A complete pipeline to pull data from Scryfall's "Magic: The Gathering"-API, via Prefect orchestration and dbt transformation.☆43Apr 27, 2023Updated 2 years ago
- Freepn tray controller for fpnd based on pygobject, app-indicator3 and gtk3☆16Dec 4, 2020Updated 5 years ago
- ☆14Feb 1, 2023Updated 3 years ago
- This repo contains "Azure Data Engineer Associate" Questions and related docs.☆13Jan 29, 2024Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- Appindicator for various cryptocoins written in python for GTK+3 systems☆17Mar 25, 2019Updated 7 years ago
- Airflow 3 demos from DevRel☆83Aug 1, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- dbt module for myBI connect☆13Jan 31, 2023Updated 3 years ago
- Mastering Convolutional Neural Networks☆11Sep 14, 2020Updated 5 years ago
- Major refactor works ongoing☆14Jul 14, 2019Updated 6 years ago
- Code review for data in dbt☆495Jan 3, 2025Updated last year
- Use MobileNet SSD and openCV to detect and count car on road☆12Jan 13, 2020Updated 6 years ago
- paraphase sentence☆11Aug 22, 2025Updated 7 months ago
- Data pipeline for uploading, preprocessing, and visualising COVID19 data☆18Apr 1, 2023Updated 3 years ago