airscholar / ApacheFlink-SalesAnalytics
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
☆10Updated last year
Alternatives and similar repositories for ApacheFlink-SalesAnalytics
Users that are interested in ApacheFlink-SalesAnalytics are comparing it to the libraries listed below
Sorting:
- This project shows how to capture changes from postgres database and stream them into kafka☆36Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆44Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆29Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆37Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆35Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆19Updated last year
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆42Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆21Updated last year
- ☆65Updated 2 weeks ago
- A custom end-to-end analytics platform for customer churn☆11Updated this week
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆11Updated last year
- Repository for Data Engineering Interview Series☆31Updated 7 months ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆19Updated 9 months ago
- ☆28Updated last year
- Simple stream processing pipeline☆102Updated 11 months ago
- ☆87Updated 2 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 2 years ago
- Simple ETL pipeline using Python☆26Updated last year
- ☆51Updated last year
- ☆23Updated last month
- ☆32Updated last year
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆11Updated last year
- The Ultimate Hands-On Hadoop - Tame your Big Data!: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data/☆8Updated 6 years ago
- ☆14Updated last year
- ☆21Updated 2 years ago
- Data Engineering with Scala, published by Packt☆24Updated last year
- ☆21Updated last year
- ☆14Updated 2 years ago
- End to end data engineering project☆54Updated 2 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆11Updated last year