jaumpedro214 / traffic-flow-spark-kafka
Testing Spark Structured Streaming anf Kafka with real data from traffic sensors
☆15Updated last year
Related projects: ⓘ
- Project for "Data pipeline design patterns" blog.☆41Updated last month
- End to end data engineering project☆49Updated last year
- Simple stream processing pipeline☆89Updated 3 months ago
- ☆35Updated 2 months ago
- Example repo to create end to end tests for data pipeline.☆21Updated 3 months ago
- Data pipeline that scrapes Rust cheater Steam profiles☆50Updated 2 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆39Updated 2 years ago
- This is a demo streaming project simulating a music streaming service.☆23Updated last month
- Kafka variant of the MLOps Level 1 stack☆22Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆36Updated 11 months ago
- ☆35Updated last year
- Delta-Lake, ETL, Spark, Airflow☆42Updated last year
- Project for real-time anomaly detection using Kafka and python☆55Updated last year
- Course Material Data Engineering on AWS Course☆26Updated last week
- Code for my "Efficient Data Processing in SQL" book.☆47Updated last month
- build dw with dbt☆26Updated last month
- End-to-end Kafka Streaming Examples on Databricks with Evolving Avro Schemas.☆9Updated 6 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆56Updated last year
- ☆27Updated 10 months ago
- ☆84Updated 2 years ago
- ☆30Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated last year
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆29Updated 4 years ago
- Near real time ETL to populate a dashboard.☆69Updated 3 months ago
- Simple ETL pipeline using Python☆20Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆26Updated last year
- Nyc_Taxi_Data_Pipeline - DE Project☆62Updated last month
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆20Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆20Updated last year
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 2 years ago