malihasameen / sales-streamingLinks
End to End Sales Streaming Pipeline (FastAPI, Kafka, Spark, Cassandra, MySQL, Superset)
☆10Updated 2 years ago
Alternatives and similar repositories for sales-streaming
Users that are interested in sales-streaming are comparing it to the libraries listed below
Sorting:
- A data pipeline with Kafka, Spark Streaming, dbt, Docker, Airflow, and GCP!☆12Updated 2 years ago
- ☆15Updated 5 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- This project aims to leverage Amazon Web Services to create trending Youtube videos analytics service. Project contains different data en…☆18Updated 2 weeks ago
- This project shows how to capture changes from postgres database and stream them into kafka☆38Updated last year
- Building Real Time Data Pipeline using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexomonster on Docker to track status …☆20Updated 4 years ago
- Question and Answer application using AWS Bedrock, AWS ECS, Langchain, Qdrant, and FastAPI☆15Updated last year
- Build Data Lake using Open Source tools☆109Updated 3 months ago
- Building a Data Pipeline with an Open Source Stack☆55Updated 2 months ago
- Project for real-time anomaly detection using Kafka and python☆58Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, PostgreSQL and Superset☆45Updated 9 months ago
- This Repository is for FastAPI projects☆40Updated 7 months ago
- This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. Ther…☆24Updated last week
- The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Pos…☆66Updated 2 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 3 years ago
- A simple pipeline utilising cron, Postgres, AWS EC2, and Metabase☆12Updated last year
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆32Updated last year
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆17Updated last year
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆14Updated 2 years ago
- Data Engineering examples for Airflow, Prefect; dbt for BigQuery, Redshift, ClickHouse, Postgres, DuckDB; PySpark for Batch processing; K…☆66Updated 2 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆39Updated last year
- New generation opensource data stack☆72Updated 3 years ago
- streamlit dashboard to analyse data☆12Updated 2 years ago
- ☆33Updated last year
- ☆44Updated last year
- ☆19Updated 4 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆47Updated last year
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆63Updated 2 years ago
- Bigdata on Kubernetes, Published by Packt☆35Updated 11 months ago
- Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team …☆124Updated last week