This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆45Dec 11, 2023Updated 2 years ago
Alternatives and similar repositories for realtime-voting-data-engineering
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆16Sep 19, 2023Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆43Jan 4, 2024Updated 2 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆12Oct 11, 2023Updated 2 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆49Dec 4, 2023Updated 2 years ago
- Apache Airflow advanced functionalities examples☆21Mar 22, 2024Updated 2 years ago
- ☆13Oct 8, 2025Updated 5 months ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆25Nov 12, 2022Updated 3 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆145Jul 27, 2023Updated 2 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆48Mar 14, 2024Updated 2 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Jul 9, 2024Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆209Oct 23, 2023Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Apr 29, 2021Updated 4 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Az…☆32Oct 2, 2023Updated 2 years ago
- CI/CD repository template to automate deployments of your production flows☆14Jul 1, 2024Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆110Jan 8, 2026Updated 2 months ago
- A streamlit multipage app template for geospatial applications☆66Apr 26, 2023Updated 2 years ago
- Collection of quick starts on docker, terraform, ansible, etc☆18Apr 9, 2024Updated last year
- This MVP data web app uses the Streamlit framework and Facebook's Prophet forecasting package to generate a dynamic forecast from your ow…☆68Jan 12, 2024Updated 2 years ago
- Example of how to build machine learning training workflow on AWS by Prefect☆12Nov 2, 2022Updated 3 years ago
- Welcome to Power BI Embedded Step by Step Series. Using this GitHub Repository you can download complete solution.☆15Jan 2, 2021Updated 5 years ago
- Capstone Project for the IBM Data Engineering Professional Certification.☆13Mar 7, 2022Updated 4 years ago
- A secure Python connector to the Kuda Microfinance Bank OpenAPI (v1.0.2)☆12Aug 13, 2022Updated 3 years ago
- Set up a Cost-Effective Modern Data Stack for a Charity☆19Mar 26, 2025Updated 11 months ago
- A Bootstrapped Template of Django with React using Docker, Postgres Database and Nginx!☆14Dec 14, 2023Updated 2 years ago
- This is an end to end MLOps system☆34Nov 27, 2025Updated 3 months ago
- Repository to host micro service implementation patterns.☆13Jun 25, 2025Updated 9 months ago
- A basic Django (DRF) backend & React frontend boilerplate app with token authentication, login/logout, and password reset functionality. …☆17Sep 12, 2022Updated 3 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- GPT-4o Powered Calorie Detecor☆18May 29, 2024Updated last year
- This solution helps you deploy ETL processes and data storage resources to create an Insurance Lake using Amazon S3 buckets for storage, …☆33Mar 12, 2026Updated last week
- ☆10Jan 8, 2024Updated 2 years ago
- An Objective-C library for uploading shots to Dribbble.☆13Mar 27, 2012Updated 13 years ago
- Upload shots to dribbble.com☆14Mar 27, 2012Updated 13 years ago
- A Covid-19 data pipeline on AWS featuring PySpark/Glue, Docker, Great Expectations, Airflow, and Redshift, templated in CloudFormation an…☆23Nov 21, 2023Updated 2 years ago