airscholar / realtime-voting-data-engineeringLinks
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆41Updated last year
Alternatives and similar repositories for realtime-voting-data-engineering
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below
Sorting:
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆272Updated 6 months ago
- YouTube tutorial project☆105Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆102Updated 5 months ago
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆33Updated last year
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆142Updated 2 years ago
- ☆142Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆38Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆38Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆151Updated last year
- ☆28Updated last year
- ☆44Updated last year
- ☆204Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆162Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆24Updated last year
- End to end data engineering project☆57Updated 2 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆28Updated 2 years ago
- Simple ETL pipeline using Python☆27Updated 2 years ago
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆15Updated 4 years ago
- Price Crawler - Tracking Price Inflation☆186Updated 5 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆22Updated 2 years ago
- ☆153Updated 3 years ago
- A template repository to create a data project with IAC, CI/CD, Data migrations, & testing☆274Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆152Updated 5 years ago
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆23Updated 2 years ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆160Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆203Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆85Updated last year
- ☆287Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year