airscholar / realtime-voting-data-engineering
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆34Updated last year
Alternatives and similar repositories for realtime-voting-data-engineering:
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆32Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆26Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆22Updated 2 years ago
- ☆28Updated last year
- YouTube tutorial project☆98Updated last year
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆107Updated 2 years ago
- ☆41Updated 7 months ago
- ☆63Updated this week
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆79Updated 6 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆35Updated 9 months ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆18Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆24Updated last year
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆17Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆117Updated last year
- Simple ETL pipeline using Python☆25Updated last year
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆11Updated last year
- Data Engineering Project with Hadoop HDFS and Kafka☆46Updated last year
- ☆149Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆11Updated last year
- ☆87Updated 2 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆179Updated last year
- End to end data engineering project☆53Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆137Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆230Updated this week
- With everything I learned from DEZoomcamp from datatalks.club, this project performs a batch processing on AWS for the cycling dataset wh…☆12Updated 2 years ago
- ☆135Updated 2 years ago