airscholar / realtime-voting-data-engineeringLinks
This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgres and Streamlit. The system is built using Docker Compose to easily spin up the required services in Docker containers.
☆38Updated last year
Alternatives and similar repositories for realtime-voting-data-engineering
Users that are interested in realtime-voting-data-engineering are comparing it to the libraries listed below
Sorting:
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆30Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆95Updated 2 months ago
- YouTube tutorial project☆103Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆36Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated last year
- End to end data engineering project☆56Updated 2 years ago
- ☆40Updated 10 months ago
- ☆28Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆139Updated last year
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆11Updated last year
- ☆139Updated 2 years ago
- ☆150Updated 3 years ago
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆251Updated 3 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆60Updated last year
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆146Updated 4 years ago
- ☆65Updated last week
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆17Updated 2 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆19Updated last year
- Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboa…☆231Updated 2 years ago
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆77Updated 11 months ago
- ☆21Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆22Updated 2 years ago
- Projects done in the Data Engineer Nanodegree Program by Udacity.com☆161Updated 2 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- ☆87Updated 2 years ago
- ☆34Updated 2 years ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated 2 years ago