In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data from the Spotify API, transform into desired format and load it into an AWS data store.
☆25May 6, 2023Updated 3 years ago
Alternatives and similar repositories for spotify-data-engineering-project
Users that are interested in spotify-data-engineering-project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project involves an ETL (Extract, Transform, Load) process to analyze sleep data exported from Apple Health☆29Apr 29, 2023Updated 3 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- ☆19May 27, 2023Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29May 2, 2023Updated 3 years ago
- ☆21Jan 13, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Base Kafka Producer, consumer, flask api and PySpark Structured streaming Job☆11Oct 20, 2021Updated 4 years ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆11May 7, 2023Updated 3 years ago
- ☆40Jul 11, 2023Updated 2 years ago
- YouTube tutorial project☆108Oct 17, 2023Updated 2 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- Data from the state of data science survey released by Anaconda each year.☆17Aug 15, 2024Updated last year
- Spotify data pipeline: Extract, transform, and analyze using AWS, Lambda, Glue, Athena, and S3.☆13Jun 27, 2023Updated 2 years ago
- A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour…☆18Aug 14, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Apr 1, 2025Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆54Feb 13, 2022Updated 4 years ago
- Practice your pandas skills!☆11Dec 10, 2019Updated 6 years ago
- This repository contains my solutions to the top 50 LeetCode SQL challenges implemented using PySpark DataFrame and PySpark SQL.☆31Mar 16, 2024Updated 2 years ago
- ☆16Jul 15, 2023Updated 2 years ago
- Data Engineering Project in GCP☆22Mar 29, 2023Updated 3 years ago
- PySpark Tutorials and Materials☆19Mar 1, 2021Updated 5 years ago
- Python - Complete Python, Django, Data Science and ML Guide, published by Packt☆15Dec 15, 2025Updated 5 months ago
- Codewars solutions in Python☆39Jun 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DataTalks.Club's Data Engineering Zoomcamp Project☆24Jul 14, 2022Updated 3 years ago
- This repo contains all the code used in the Python for Data Engineering Course☆363Apr 24, 2024Updated 2 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆267Jan 1, 2023Updated 3 years ago
- A project portfolio to accompany my resume☆30Sep 5, 2023Updated 2 years ago
- ☆23Nov 8, 2022Updated 3 years ago
- ☆16May 29, 2023Updated 3 years ago
- ☆215Aug 13, 2023Updated 2 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆146Aug 23, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- tutorial-vending-machine-marcgibbons created by GitHub Classroom☆10May 22, 2019Updated 7 years ago
- ☆10Jun 3, 2017Updated 8 years ago
- RedditR for Content Engagement and Recommendation☆18Dec 21, 2017Updated 8 years ago
- Sample Faust project to process tweets in real-time☆13Mar 29, 2021Updated 5 years ago
- Repository containing projects and summaries of my studies in the field of Data Engineering.☆55May 22, 2026Updated last week
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Feb 3, 2026Updated 3 months ago
- Pyspark Spotify ETL☆17Aug 19, 2021Updated 4 years ago