jaumpedro214 / traffic-flow-spark-kafkaLinks
Testing Spark Structured Streaming anf Kafka with real data from traffic sensors
☆16Updated 2 years ago
Alternatives and similar repositories for traffic-flow-spark-kafka
Users that are interested in traffic-flow-spark-kafka are comparing it to the libraries listed below
Sorting:
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- RedditR for Content Engagement and Recommendation☆21Updated 7 years ago
- ☆40Updated 11 months ago
- Simple ETL pipeline using Python☆26Updated 2 years ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆69Updated last year
- Design/Implement stream/batch architecture on NYC taxi data | #DE☆25Updated 4 years ago
- A list of all my posts and personal projects☆73Updated 11 months ago
- ☆40Updated 2 years ago
- Example repo to create end to end tests for data pipeline.☆24Updated 11 months ago
- ☆37Updated 5 years ago
- ☆87Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated 2 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- ☆16Updated last year
- Simple stream processing pipeline☆103Updated 11 months ago
- ☆38Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- End to end data engineering project☆56Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆39Updated last year
- Data pipeline that scrapes Rust cheater Steam profiles☆51Updated 3 years ago
- Repository for Data Engineering Interview Series☆31Updated 7 months ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆22Updated 3 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆31Updated last year
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- A course by DataTalks Club that covers Spark, Kafka, Docker, Airflow, Terraform, DBT, Big Query etc☆13Updated 3 years ago
- An example of an ETL pipeline that lays out generic DE processes. This is now out of date but still provides useful information☆26Updated 3 years ago