Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
☆167Jun 16, 2020Updated 5 years ago
Alternatives and similar repositories for Movalytics-Data-Warehouse
Users that are interested in Movalytics-Data-Warehouse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Personal Data Engineering Projects☆1,017Feb 8, 2023Updated 3 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆51Aug 23, 2019Updated 6 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,920Aug 26, 2022Updated 3 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆349Jan 12, 2022Updated 4 years ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,513Mar 9, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆139Apr 18, 2020Updated 6 years ago
- Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy☆22Dec 26, 2020Updated 5 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆24May 14, 2022Updated 4 years ago
- My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…☆506Aug 24, 2022Updated 3 years ago
- Example end to end data engineering project.☆1,412Dec 8, 2022Updated 3 years ago
- Execution of DBT models using Apache Airflow through Docker Compose☆132Jan 3, 2023Updated 3 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆56May 6, 2023Updated 3 years ago
- Udacity Data Engineering Nano Degree (DEND)☆188Jan 20, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Apache Spark Guide☆38Feb 1, 2022Updated 4 years ago
- ☆20Jan 23, 2023Updated 3 years ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11May 22, 2018Updated 8 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 9 months ago
- Beginner data engineering project - batch edition☆581Apr 13, 2026Updated 2 months ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆275Mar 1, 2026Updated 3 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆77Dec 12, 2023Updated 2 years ago
- ☆16May 29, 2023Updated 3 years ago
- Simple ETL pipeline using Python☆29May 22, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- This project helps me to understand the core concepts of Apache Airflow. I have created custom operators to perform tasks such as staging…☆104Aug 11, 2019Updated 6 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 5 years ago
- This is a web app built for easy explainability of machine learning models without writing any code in order to explain easily to non-te…☆20Jan 10, 2024Updated 2 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆880Apr 16, 2022Updated 4 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 5 years ago
- This is a template you can use for your next data engineering portfolio project.☆191Sep 10, 2021Updated 4 years ago
- Finance 🏦 Data Builder 🛠️ @ postgres 🐘☆22Feb 11, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An end-to-end ETL pipeline that extracts weather data, transforms it, and loads it into a PostgreSQL database.☆14Sep 6, 2024Updated last year
- ☆16Dec 23, 2023Updated 2 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆22Jul 9, 2019Updated 6 years ago
- End-to-End BI & DW project: Data Warehousing design and modeling (MySQL), ETL (PDI) and Dashboard (Tableau)☆17Aug 10, 2020Updated 5 years ago
- Apache Spark using SQL☆14Aug 18, 2021Updated 4 years ago
- ☆19May 27, 2023Updated 3 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆92Jul 17, 2019Updated 6 years ago