abdkumar / spotify-stream-analyticsLinks
Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
☆71Updated 2 years ago
Alternatives and similar repositories for spotify-stream-analytics
Users that are interested in spotify-stream-analytics are comparing it to the libraries listed below
Sorting:
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆373Updated 2 years ago
- ☆163Updated 3 years ago
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆245Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆284Updated last year
- ☆148Updated 3 years ago
- ☆383Updated last year
- Code for "Efficient Data Processing in Spark" Course☆360Updated 3 months ago
- Sample project to demonstrate data engineering best practices☆202Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews