PritomDas / Real-Time-Streaming-Data-Pipeline-and-Dashboard
Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status of Servers in the Data Center across the Globe.
☆20Updated 4 years ago
Alternatives and similar repositories for Real-Time-Streaming-Data-Pipeline-and-Dashboard:
Users that are interested in Real-Time-Streaming-Data-Pipeline-and-Dashboard are comparing it to the libraries listed below
- Project for real-time anomaly detection using Kafka and python☆57Updated 2 years ago
- Spark, Airflow, Kafka☆26Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆41Updated last year
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆27Updated 11 months ago
- Challenge Data Engineer☆25Updated 2 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆11Updated last year
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆24Updated last year
- This repo gives an introduction to setting up streaming analytics using open source technologies☆24Updated last year
- Udacity Data Streaming Nanodegree Program☆22Updated 3 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆39Updated last year
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆21Updated 2 years ago
- This repository about how to deploy machine learning model end serving with FastAPI and using MLFlow-MINIO☆18Updated last year
- ☆29Updated last year
- Deploying a Machine Learning model streaming application with Apache Kafka☆10Updated 2 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- Delta-Lake, ETL, Spark, Airflow☆45Updated 2 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆32Updated last year
- Design/Implement stream/batch architecture on NYC taxi data | #DE☆26Updated 3 years ago
- ☆13Updated 2 years ago
- A Postgres data warehouse for processing synthetic data using IAC principles☆15Updated last year
- Kafka variant of the MLOps Level 1 stack☆22Updated 2 years ago
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆36Updated last year
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- ☆40Updated 6 months ago
- A modern, enterprise-ready business intelligence web application☆32Updated 2 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆95Updated 5 months ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆69Updated 8 years ago
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆22Updated 4 years ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Updated 3 years ago