In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it into an AWS data store.
☆25May 6, 2023Updated 2 years ago
Alternatives and similar repositories for spotify-data-engineering-project
Users that are interested in spotify-data-engineering-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 2 years ago
- ☆19May 27, 2023Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Base Kafka Producer, consumer, flask api and PySpark Structured streaming Job☆11Oct 20, 2021Updated 4 years ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆11May 7, 2023Updated 2 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 2 years ago
- ☆40Jul 11, 2023Updated 2 years ago
- Spotify data pipeline: Extract, transform, and analyze using AWS, Lambda, Glue, Athena, and S3.☆13Jun 27, 2023Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆213Dec 31, 2025Updated 2 months ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- PySpark Tutorials and Materials☆19Mar 1, 2021Updated 5 years ago
- Codewars solutions in Python☆39Jun 15, 2023Updated 2 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆353Apr 24, 2024Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆23Nov 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16May 29, 2023Updated 2 years ago
- ☆213Aug 13, 2023Updated 2 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆144Aug 23, 2023Updated 2 years ago
- ☆10Jun 3, 2017Updated 8 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Repository containing projects and summaries of my studies in the field of Data Engineering.☆54Jan 28, 2026Updated 2 months ago
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 5 years ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository implements a real-time credit card fraud detection pipeline using Kafka, Spark and Cassandra. Kafka continuously produces…☆23Feb 3, 2021Updated 5 years ago
- Data Structures in Python☆10Mar 16, 2026Updated last week
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆64Mar 9, 2024Updated 2 years ago
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆12Nov 6, 2023Updated 2 years ago
- class code of Ai chatbot and voice app online course☆11Jul 22, 2025Updated 8 months ago
- Do you want to LEARN NEW STUFF for FREE? Don't worry, with the power of web-scraping and automation, this script will find the necessary …☆19Feb 29, 2024Updated 2 years ago
- CS231n Convolutional Neural Networks for Visual Recognition☆12Aug 17, 2021Updated 4 years ago