abdkumar/spotify-stream-analytics

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abdkumar/spotify-stream-analytics)

abdkumar / spotify-stream-analytics

Generate synthetic Spotify music stream dataset to create dashboards. Spotify API generates fake event data emitted to Kafka. Spark consumes and processes Kafka data, saving it to the Datalake. Airflow orchestrates the pipeline. dbt moves data to Snowflake, transforms it, and creates dashboards.

☆72

Alternatives and similar repositories for spotify-stream-analytics

Users that are interested in spotify-stream-analytics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaoutaar / end-to-end-etl-pipeline-jcdecaux-API
View on GitHub
velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…
☆21Aug 12, 2025Updated 11 months ago
augustodn / pyflink-docker
View on GitHub
Simple project using pyflink, kafka and postgre containerized using Docker
☆11Aug 26, 2024Updated last year
ahmedembeddedxx / AskFAST
View on GitHub
AskFAST is a chatbot designed to handle admission-related queries for FAST. It’s your go-to AI assistant for all things admission at FAST…
☆11Aug 5, 2024Updated last year
Hamagistral / OnlineRetail-DataEng
View on GitHub
⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.
☆26Oct 19, 2023Updated 2 years ago
SQLMCT / SQL_Performance_Tuning
View on GitHub
☆15Apr 14, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bhasarma / kitchenware-classification-project
View on GitHub
This repository contains the capstone project carried out as part of Machine Learning Zoomcamp course
☆10Dec 26, 2022Updated 3 years ago
bhasarma / mlcoursezoom-camp
View on GitHub
This repository contains notebooks, homework, projects and notes done during Machine Learning Zoomcamp course.
☆13Nov 13, 2024Updated last year
Mariyajoseph24 / 8_Week_SQL_challenge
View on GitHub
"#8WeekSQLChallenge : Solutions for a thrilling project that offers weekly SQL case studies! Engaged in real-world data analysis with int…
☆20Aug 30, 2023Updated 2 years ago
DataTalksClub / zoomcamp-analytics
View on GitHub
Public data and analytics for our open course
☆34Mar 22, 2024Updated 2 years ago
josephmachado / local_dev
View on GitHub
Local development environment for python data projects, with Docker
☆23Dec 14, 2022Updated 3 years ago
Rajsingh92 / MUST_HAVE_SKILLS
View on GitHub
This repo consists of all important concepts for data engineers.
☆11Jun 2, 2026Updated last month
raashidsalih / churn-pipeline
View on GitHub
A custom end-to-end analytics platform for customer churn
☆10May 15, 2025Updated last year
jaumpedro214 / traffic-flow-spark-kafka
View on GitHub
Testing Spark Structured Streaming anf Kafka with real data from traffic sensors
☆17Nov 11, 2022Updated 3 years ago
jeantardelli / data-engineering-with-python
View on GitHub
Here I will be exploring various tools and methods that are used in data engineering process with Python.
☆21Jan 4, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
subhamkharwal / ease-with-apache-spark
View on GitHub
Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
☆55Sep 30, 2023Updated 2 years ago
findmypast / recruitment-test-data-engineering
View on GitHub
Code test for data engineering candidates
☆47Mar 27, 2024Updated 2 years ago
PacktPublishing / Data-Augmentation-with-Python
View on GitHub
Data Augmentation with Python, published by Packt
☆37Oct 28, 2024Updated last year
teacherc / de_zoomcamp_candace2023
View on GitHub
Candace's Data Engineering Zoomcamp files and notes
☆18Jul 4, 2023Updated 3 years ago
salmah52 / youtubeetl
View on GitHub
☆15Oct 19, 2023Updated 2 years ago
threegenie / sentiment_project
View on GitHub
네이버 쇼핑 리뷰 데이터를 통해 감성 분석하기(GRU, LSTM)
☆10Sep 27, 2021Updated 4 years ago
josephmachado / analytical_dp_with_sql
View on GitHub
Code for my "Efficient Data Processing in SQL" book.
☆63Aug 6, 2024Updated last year
ognis1205 / delta-hub
View on GitHub
A platform and cloud-based service for data sharing based on the Delta Sharing protocol.
☆21Jun 12, 2024Updated 2 years ago
edtk / vagrant-box
View on GitHub
📦 Starting box for Vagrant. Inside box Ubuntu 20.04 LTS with Git, Docker and Docker compose.
☆19May 5, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
siddharth271101 / Covid-19-and-Aviation-Industry
View on GitHub
The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…
☆13Jun 26, 2022Updated 4 years ago
Lal4Tech / Data-Engineering-With-AWS
View on GitHub
Resources and projects from Udacity Data Engineering with AWS nano degree programme
☆29Apr 12, 2023Updated 3 years ago
sergio-pestana / Data-Pipeline-using-Airflow-DBT-and-GCP
View on GitHub
☆21Nov 4, 2023Updated 2 years ago
YFChiu / Resources--Databricks-Certified-Associate-Developer-for-Apache-Spark-3.0
View on GitHub
(Python, PySpark)
☆11Nov 15, 2020Updated 5 years ago
danielbeach / tinytimmy
View on GitHub
A simple and easy to use Data Quality (DQ) tool built with Python.
☆51Sep 7, 2023Updated 2 years ago
kisehyun / STUDY
View on GitHub
개인 공부
☆13Apr 1, 2023Updated 3 years ago
mordp1 / espetinhodekafka
View on GitHub
Para entender e aprender um pouco sobre o Apache Kafka.https://www.youtube.com/channel/UC3pevgVzUWKo5CoWdhDsoHw
☆15Updated this week
softwaredoug / hello-ltr
View on GitHub
Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch
☆17Jun 19, 2022Updated 4 years ago
mrn-aglic / apache-iceberg-data-exploration
View on GitHub
☆23Feb 5, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
FL-Marine / 8-Week-SQL-Challenge
View on GitHub
Case Study's from Danny Ma's Serious SQL Course
☆19Aug 4, 2022Updated 3 years ago
dlarionov / big-data-essentials
View on GitHub
Coursera, Big Data Essentials: HDFS, MapReduce and Spark RDD
☆12Jun 18, 2019Updated 7 years ago
aws-samples / data-engineering-on-aws
View on GitHub
☆22Oct 21, 2024Updated last year
priye-1 / airflow_data_pipeline
View on GitHub
☆16May 29, 2023Updated 3 years ago
meetapandit / nyc-citibike-data-pipeline
View on GitHub
☆12Jul 8, 2024Updated 2 years ago
KiranGunturu / lakehouse-formation
View on GitHub
☆23Sep 25, 2024Updated last year
PacktPublishing / Apache-Spark-3-for-Data-Engineering-and-Analytics-with-Python-
View on GitHub
Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing
☆24Jul 23, 2023Updated 2 years ago