PritomDas / Real-Time-Streaming-Data-Pipeline-and-Dashboard
Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status of Servers in the Data Center across the Globe.
☆20Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Real-Time-Streaming-Data-Pipeline-and-Dashboard
- Project for real-time anomaly detection using Kafka and python☆56Updated last year
- Spark, Airflow, Kafka☆26Updated last year
- ☆37Updated 4 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆35Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆37Updated 11 months ago
- ☆14Updated 4 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆11Updated last year
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated last year
- Building a Data Pipeline with an Open Source Stack☆38Updated 4 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆32Updated 11 months ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆23Updated 11 months ago
- Design/Implement stream/batch architecture on NYC taxi data | #DE☆26Updated 3 years ago
- A end-to-end real-time stock market data pipeline with Python, AWS EC2, Apache Kafka, and Cassandra Data is processed on AWS EC2 with Apa…☆22Updated last year
- ☆29Updated 11 months ago
- ☆38Updated 4 months ago
- Delta-Lake, ETL, Spark, Airflow☆44Updated 2 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- ☆32Updated last year
- ☆60Updated last week
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Near real time ETL to populate a dashboard.☆70Updated 5 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆40Updated 11 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆29Updated 10 months ago
- A pipeline to detect data drift and retrain the model when there is drift☆22Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆67Updated 3 months ago
- Airflow Tutorials☆24Updated 3 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆31Updated 6 months ago
- Learn how to build and deploy NLP model with FastAPI☆30Updated 3 years ago
- ☆36Updated last year