airscholar / realtime-voting-data-engineeringLinks
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆43Updated last year
Alternatives and similar repositories for realtime-voting-data-engineering
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below
Sorting:
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆37Updated last year
- YouTube tutorial project☆105Updated 2 years ago
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆107Updated 8 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆168Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆295Updated 9 months ago
- ☆70Updated last month
- ☆210Updated 2 years ago
- End to end data engineering project☆57Updated 3 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆143Updated 2 years ago
- Simple ETL pipeline using Python☆29Updated 2 years ago
- ☆88Updated 3 years ago
- ☆44Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- ☆162Updated 3 years ago
- ☆144Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated 2 years ago
- ☆29Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆162Updated 3 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆177Updated 3 months ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆92Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆25Updated 2 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆158Updated 5 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆210Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆38Updated last year
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆56Updated 2 years ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆369Updated 2 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆23Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆243Updated 2 years ago