malihasameen / sales-streamingLinks
End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)
☆10Updated 2 years ago
Alternatives and similar repositories for sales-streaming
Users that are interested in sales-streaming are comparing it to the libraries listed below
Sorting:
- ☆15Updated 5 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆46Updated 2 years ago
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Updated 2 years ago
- Spark, Airflow, Kafka☆24Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆42Updated 3 years ago
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆22Updated 5 years ago
- This project shows how to capture changes from postgres database and stream them into kafka☆39Updated last year
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆20Updated 4 months ago
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆34Updated last year
- Project for real-time anomaly detection using Kafka and python☆58Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆49Updated 2 years ago
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆74Updated 2 years ago
- This is a demo streaming project simulating a music streaming service.☆34Updated last year
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Updated last year
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Updated 3 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Updated 2 years ago
- ☆32Updated 2 years ago
- Build Data Lake using Open Source tools☆119Updated 7 months ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆35Updated 3 weeks ago
- Question and Answer application using AWS Bedrock, AWS ECS, Langchain, Qdrant, and FastAPI☆15Updated last year
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆68Updated last month
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Updated 3 weeks ago
- ☆45Updated last year
- This Repository is for FastAPI projects☆55Updated 3 weeks ago
- Building a Data Pipeline with an Open Source Stack☆55Updated 6 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆45Updated 2 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆17Updated 2 years ago
- streamlit dashboard to analyse data☆12Updated 2 years ago
- Example of event-driven architecture with FastAPI Gateway, Kafka, Redis pub/sub and Faust-streaming☆16Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago