Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.
☆72Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for spotify-stream-analytics
Users that are interested in spotify-stream-analytics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12May 27, 2024Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Aug 26, 2024Updated last year
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆25Oct 19, 2023Updated 2 years ago
- Streaming analytics project with eventsim and Kafka☆13Dec 23, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repository contains the capstone project carried out as part of Machine Learning Zoomcamp course☆10Dec 26, 2022Updated 3 years ago
- Public data and analytics for our open course☆34Mar 22, 2024Updated 2 years ago
- This Power BI project provides insights into customer orders and product tracking using interactive dashboards. It visualizes order statu…☆10Aug 15, 2025Updated 9 months ago
- This repository contains notebooks, homework, projects and notes done during Machine Learning Zoomcamp course.☆13Nov 13, 2024Updated last year
- Test driven learning of Cascading.☆39Feb 11, 2020Updated 6 years ago
- Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision Language Models☆21Oct 12, 2025Updated 7 months ago
- Local development environment for python data projects, with Docker☆23Dec 14, 2022Updated 3 years ago
- ☆11Aug 10, 2023Updated 2 years ago
- ☆17Apr 19, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Sep 9, 2023Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆10May 15, 2025Updated last year
- This is a capstone project associated with MLOps Zoomcamp. The end goal of the project is to build an end-to-end machine learning projec…☆13Sep 8, 2022Updated 3 years ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated 2 years ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆55Sep 30, 2023Updated 2 years ago
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆21Jan 4, 2021Updated 5 years ago
- Data Science Intern at Data Glacier☆12Jun 30, 2022Updated 3 years ago
- 🤖 An autonomous AI agent system that collaboratively designs, implements, and manages Apache Airflow DAGs through natural language inter…☆28Aug 6, 2025Updated 9 months ago
- Candace's Data Engineering Zoomcamp files and notes☆18Jul 4, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- A project management CLI written in purely SQL.☆17Dec 31, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- SQL scripts that demonstrate various features and concepts.☆14Updated this week
- 네이버 쇼핑 리뷰 데이터를 통해 감성 분석하기(GRU, LSTM)☆10Sep 27, 2021Updated 4 years ago
- An AWS Data Engineering End-to-End Project (Glue, Lambda, Kinesis, Redshift, QuickSight, Athena, EC2, S3)☆16Sep 20, 2023Updated 2 years ago
- This project demonstrates how to build and automate an ETL pipeline written in Python and schedule it using open source Apache Airflow or…☆23Aug 21, 2025Updated 8 months ago
- Data Augmentation with Python, published by Packt☆37Oct 28, 2024Updated last year
- Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw☆13Mar 10, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 📦 Starting box for Vagrant. Inside box Ubuntu 20.04 LTS with Git, Docker and Docker compose.☆19May 5, 2022Updated 4 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Data Engineer Project: An end-to-end Airflow data pipeline with BigQuery, dbt Soda, and more!☆12Dec 14, 2023Updated 2 years ago
- GitHub Actions to Validate DAGs, Variables and Dependencies upon Pull Request☆23Mar 5, 2026Updated 2 months ago
- ☆15Oct 19, 2023Updated 2 years ago
- ☆15Mar 15, 2024Updated 2 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago