malihasameen / sales-streamingLinks
End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)
☆10Updated 2 years ago
Alternatives and similar repositories for sales-streaming
Users that are interested in sales-streaming are comparing it to the libraries listed below
Sorting:
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Updated 2 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆20Updated 4 years ago
- Spark, Airflow, Kafka☆26Updated 2 years ago
- Example of event-driven architecture with FastAPI Gateway, Kafka, Redis pub/sub and Faust-streaming☆16Updated 3 years ago
- ☆15Updated 5 years ago
- This Repository is for FastAPI projects☆39Updated 7 months ago
- Nyc_Taxi_Data_Pipeline - DE Project☆116Updated 9 months ago
- Project for real-time anomaly detection using Kafka and python☆58Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆32Updated last year
- Question and Answer application using AWS Bedrock, AWS ECS, Langchain, Qdrant, and FastAPI☆15Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆37Updated last year
- End-to-end Kafka Streaming Examples on Databricks with Evolving Avro Schemas.☆9Updated last year
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆47Updated last year
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆65Updated 2 years ago
- ☆17Updated 3 months ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Updated last year
- ☆33Updated last year
- DataTalks.Club's Data Engineering Zoomcamp Project☆23Updated 3 years ago
- An end-to-end MLOps pipeline(CI/CD/CT/CM) project for training, versioning, deploying, and monitoring machine learning models using FastA…☆18Updated last year
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- A Postgres data warehouse for processing synthetic data using IAC principles☆18Updated 2 years ago
- Building a Data Pipeline with an Open Source Stack☆55Updated last month
- Build Data Lake using Open Source tools☆108Updated 2 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆38Updated last year
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- Spark data pipeline that processes movie ratings data.☆29Updated last week
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆41Updated last year
- A curated list of awesome open source tools and commercial products that will help you train, deploy, monitor, version, scale, and secure…☆17Updated 3 years ago