In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it into an AWS data store.
☆25May 6, 2023Updated 2 years ago
Alternatives and similar repositories for spotify-data-engineering-project
Users that are interested in spotify-data-engineering-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 2 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- ☆19May 27, 2023Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 2 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Base Kafka Producer, consumer, flask api and PySpark Structured streaming Job☆11Oct 20, 2021Updated 4 years ago
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆12Jul 5, 2023Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆11May 7, 2023Updated 2 years ago
- 65 Articles on SQL: A Comprehensive Guide to Mastering Advanced SQL☆11Jun 7, 2023Updated 2 years ago
- ☆40Jul 11, 2023Updated 2 years ago
- Data from the state of data science survey released by Anaconda each year.☆17Aug 15, 2024Updated last year
- Spotify data pipeline: Extract, transform, and analyze using AWS, Lambda, Glue, Athena, and S3.☆13Jun 27, 2023Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆217Dec 31, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 8 months ago
- ☆16Apr 1, 2025Updated last year
- Practice your pandas skills!☆11Dec 10, 2019Updated 6 years ago
- This repository contains my solutions to the top 50 LeetCode SQL challenges implemented using PySpark DataFrame and PySpark SQL.☆29Mar 16, 2024Updated 2 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆15Dec 15, 2025Updated 4 months ago
- Codewars solutions in Python☆39Jun 15, 2023Updated 2 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- ☆146Jan 31, 2023Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆23Nov 8, 2022Updated 3 years ago
- ☆16May 29, 2023Updated 2 years ago
- ☆214Aug 13, 2023Updated 2 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- tutorial-vending-machine-marcgibbons created by GitHub Classroom☆10May 22, 2019Updated 6 years ago
- ☆10Jun 3, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 5 years ago
- Repository containing projects and summaries of my studies in the field of Data Engineering.☆54Jan 28, 2026Updated 2 months ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 2 months ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago
- ☆14May 14, 2019Updated 6 years ago