airscholar / realtime-voting-data-engineering
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆36Updated last year
Alternatives and similar repositories for realtime-voting-data-engineering:
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆27Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆34Updated last year
- ☆28Updated last year
- YouTube tutorial project☆101Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆18Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated 11 months ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆91Updated last month
- ☆40Updated 9 months ago
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆129Updated last year
- Simple ETL pipeline using Python☆26Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆41Updated last year
- ☆21Updated last year
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆11Updated last year
- End to end data engineering project☆54Updated 2 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆21Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆143Updated 4 years ago
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆45Updated 5 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆244Updated 2 months ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆160Updated 2 years ago
- ☆87Updated 2 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆74Updated 10 months ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆11Updated last year
- Near real time ETL to populate a dashboard.☆73Updated 10 months ago