airscholar / ApacheFlink-SalesAnalytics
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
☆11Updated last year
Alternatives and similar repositories for ApacheFlink-SalesAnalytics:
Users that are interested in ApacheFlink-SalesAnalytics are comparing it to the libraries listed below
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆39Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆31Updated 8 months ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆35Updated 10 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆25Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆32Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆19Updated last year
- ☆29Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆16Updated last year
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆11Updated last year
- ☆11Updated last year
- A custom end-to-end analytics platform for customer churn☆10Updated last week
- Simple stream processing pipeline☆96Updated 7 months ago
- ☆61Updated 3 weeks ago
- End to end data engineering project☆53Updated 2 years ago
- ☆28Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆224Updated last year
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- Data Engineering Project in GCP☆18Updated last year
- Repo which holds the materials for the EMR Zero To Hero☆27Updated 2 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆32Updated 9 months ago
- Simple ETL pipeline using Python☆25Updated last year
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆102Updated 8 months ago
- Repository for Data Engineering Interview Series☆28Updated 3 months ago
- Project for "Data pipeline design patterns" blog.☆43Updated 5 months ago
- ☆40Updated 6 months ago
- Code to demonstrate data engineering metadata & logging best practices☆15Updated 10 months ago
- Companion repository for the "Streamlining AWS Glue CI/CD — A Comprehensive Blueprint" blog post☆11Updated 2 months ago
- ☆21Updated last year
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆39Updated last year