This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow orchestration tool on AWS EC2 instance.
☆23Aug 21, 2025Updated 9 months ago
Alternatives and similar repositories for Python-ETL-pipeline-using-Airflow-on-AWS
Users that are interested in Python-ETL-pipeline-using-Airflow-on-AWS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆12Jul 5, 2023Updated 2 years ago
- Udacity Data Engineering Nano Degree Project, Data Modeling for fact and dimension tables, and ETL pipeline that transfers data from file…☆10Dec 12, 2020Updated 5 years ago
- ☆19Nov 27, 2023Updated 2 years ago
- ☆13Jun 15, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. Ther…☆27Aug 22, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Spotify ETL Pipeline☆13Oct 13, 2023Updated 2 years ago
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- ☆31Jun 12, 2023Updated 2 years ago
- Local SQL Database ---> Azure ---> Power BI☆15Oct 13, 2023Updated 2 years ago
- ☆13Jun 9, 2022Updated 3 years ago
- ☆11Oct 8, 2021Updated 4 years ago
- Source code related of the articles posted in medium.com☆12Nov 2, 2020Updated 5 years ago
- This is an Analysis to optimise inventory managemnt for FitCapacity company by analyzing sales and inventory data using SQL and PowerBI☆28Sep 6, 2023Updated 2 years ago
- Large language models to diffusion finetuning code☆26Jun 2, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆21Oct 21, 2024Updated last year
- ☆14Sep 22, 2022Updated 3 years ago
- Tool which summarizes daily and total gas consumption of all transactions sent from a specified Ethereum address.☆15Jun 28, 2023Updated 2 years ago
- This is a demo repository for parallel multi-index question answering using streamlit and llama index☆24Aug 31, 2023Updated 2 years ago
- querycrafter repo☆23Feb 18, 2026Updated 3 months ago
- 開發工具知識管理共筆☆25Jun 9, 2016Updated 9 years ago
- This repo contains my projects from the Udacity Data Engineering Nano degree☆13Apr 26, 2023Updated 3 years ago
- Design Patterns for Humans™ - 超簡化解釋(繁體中文版)☆26Mar 15, 2017Updated 9 years ago
- ☆13Nov 4, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆13Dec 18, 2024Updated last year
- code snippet for analytics sessions☆34May 17, 2022Updated 4 years ago
- Azure OpenAI integration as a custom skillset in Azure Cognitive Search☆35Mar 28, 2023Updated 3 years ago
- ☆21Mar 31, 2024Updated 2 years ago
- Machine Learning Data Pipelines Presentation☆17Nov 2, 2023Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆169Dec 8, 2022Updated 3 years ago
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆22Aug 16, 2025Updated 9 months ago
- A simple API to retrieve some quotes of Lucifer, shawty !☆12Oct 29, 2025Updated 6 months ago
- A collection of AWS templates for deploying AI Agents☆21Oct 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- See the kind of artist you are on Github☆24Sep 6, 2021Updated 4 years ago
- Test Driven Development Class For O'Reilly Publishing☆16Jul 6, 2018Updated 7 years ago
- Udacity's 5 Month Data Engineering Nanodegree program. This repo includes all the projects completed.☆27May 31, 2020Updated 5 years ago
- n8n node to interact with Firecrawl☆47Updated this week
- This project introduces PySpark, a powerful open-source framework for distributed data processing. We explore its architecture, component…☆46Sep 26, 2024Updated last year
- ☆23Jun 2, 2021Updated 4 years ago
- Providing an easy way to deploy a Glue job in any AWS account using Terraform☆25Aug 14, 2024Updated last year