In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it into an AWS data store.
☆25May 6, 2023Updated 2 years ago
Alternatives and similar repositories for spotify-data-engineering-project
Users that are interested in spotify-data-engineering-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 2 years ago
- ☆19May 27, 2023Updated 2 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- ☆171May 20, 2022Updated 3 years ago
- Base Kafka Producer, consumer, flask api and PySpark Structured streaming Job☆11Oct 20, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆12Jul 5, 2023Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆11May 7, 2023Updated 2 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 2 years ago
- ☆40Jul 11, 2023Updated 2 years ago
- YouTube tutorial project☆108Oct 17, 2023Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆17Mar 31, 2024Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆217Dec 31, 2025Updated 3 months ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 8 months ago
- ☆16Apr 1, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- Practice your pandas skills!☆11Dec 10, 2019Updated 6 years ago
- This repository contains my solutions to the top 50 LeetCode SQL challenges implemented using PySpark DataFrame and PySpark SQL.☆29Mar 16, 2024Updated 2 years ago
- ☆20Mar 9, 2023Updated 3 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- PySpark Tutorials and Materials☆19Mar 1, 2021Updated 5 years ago
- Airflow POC demo : 1) env set up 2) airflow DAG 3) Spark/ML pipeline | #DE☆11Dec 19, 2022Updated 3 years ago
- Codewars solutions in Python☆39Jun 15, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆359Apr 24, 2024Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆23Nov 8, 2022Updated 3 years ago
- ☆16May 29, 2023Updated 2 years ago
- ☆214Aug 13, 2023Updated 2 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Jun 3, 2017Updated 8 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 5 years ago
- Repository containing projects and summaries of my studies in the field of Data Engineering.☆54Jan 28, 2026Updated 2 months ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆66Mar 9, 2024Updated 2 years ago