End-to-end ELT data engineering project
☆23Dec 24, 2022Updated 3 years ago
Alternatives and similar repositories for End-to-end-data-enginnerring-project
Users that are interested in End-to-end-data-enginnerring-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building Data Warehouse on BigQuery which takes flat file as the data sources with Airflow as the Orchestrator☆13May 23, 2021Updated 5 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- In this project I have built etl pipline which scraps the trending repository based on month,week and day LIVE extract other related info…☆12Sep 9, 2023Updated 2 years ago
- NoSQL extract, transform, load (ETL) toolkit with Python☆16May 9, 2026Updated 2 weeks ago
- A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…☆23Nov 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- Portfolio of projects and studies conducted in data engineering.☆34Feb 22, 2025Updated last year
- Spark data pipeline that processes movie ratings data.☆31May 1, 2026Updated 3 weeks ago
- A simple tool for monitoring the progress of OpenFOAM simulations☆13Nov 9, 2018Updated 7 years ago
- Matlab toolbox for generating block structured hex meshes in the polyMesh file format of OpenFOAM.☆13Jan 2, 2013Updated 13 years ago
- StarCraft 2 Data Pipeline with Airflow, DuckDB and Streamlit☆16Mar 14, 2024Updated 2 years ago
- ☆15Jan 26, 2023Updated 3 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- Python wrapper for OpenFOAM meshes☆13Sep 16, 2025Updated 8 months ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- Skooldio: Data Pipelines with Airflow☆23May 24, 2025Updated last year
- Generate OpenAPI 3.x.x using Pydantic☆11Feb 9, 2023Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆10Feb 12, 2026Updated 3 months ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 3 years ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code to build models that effectively predict promoter-driven gene expression☆12May 15, 2025Updated last year
- postProcessing tool for OpenFOAM, transform OpenFOAM fields to one single file by columns☆18May 11, 2021Updated 5 years ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆26Nov 12, 2022Updated 3 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆15May 22, 2023Updated 3 years ago
- Haraka SMTP plugin for logging outbound traffic. Useful for storing audit information of delivered/bounced emails.☆16Jan 12, 2023Updated 3 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- Google Ad Manager API Client Library for NodeJs.☆12Jul 2, 2023Updated 2 years ago
- Coupon System project: SpringBoot & AngularTS☆11Jan 3, 2021Updated 5 years ago
- Example using Great Expectations to Validate Data in a scikit-learn Pipeline☆21Jul 23, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆25Dec 18, 2020Updated 5 years ago
- Creation of a Fantasy Premier League data pipeline for analysis of both team & player performance. Technologies include, dbt, Prefect, Te…☆11Apr 13, 2023Updated 3 years ago
- ☆12Jul 10, 2023Updated 2 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 3 years ago
- A self-contained, ready to run Airflow and Kafka project. Can be run locally or within codespaces.☆16Jul 15, 2023Updated 2 years ago
- Modern Data Engineering Project☆12Jun 3, 2022Updated 3 years ago
- OpenFOAM-7 third-party library compilation scripts☆11Jun 9, 2020Updated 5 years ago