airscholar / ApacheFlink-SalesAnalytics
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
☆11Updated last year
Alternatives and similar repositories for ApacheFlink-SalesAnalytics:
Users that are interested in ApacheFlink-SalesAnalytics are comparing it to the libraries listed below
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆41Updated last year
- This project shows how to capture changes from postgres database and stream them into kafka☆35Updated 9 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆39Updated 11 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆32Updated last year
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆26Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆18Updated last year
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆19Updated last year
- Data Engineering Project in GCP☆18Updated last year
- End to end data engineering project☆53Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆10Updated last month
- Simple stream processing pipeline☆99Updated 8 months ago
- ☆64Updated this week
- ☆11Updated last year
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- Repository for Data Engineering Interview Series☆28Updated 4 months ago
- ☆28Updated last year
- ☆30Updated last year
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/☆34Updated 10 months ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆25Updated 2 years ago
- ☆11Updated last year
- In this project, we will build and ETL(Extract,Transform,Load) pipeline using the Spotify API on AWS. The pipeline will retrieve data fro…☆21Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆14Updated 3 years ago
- Data Engineering with Google Cloud Platform, published by Packt☆113Updated last year
- ☆21Updated last year
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆21Updated 2 years ago
- Simple ETL pipeline using Python☆25Updated last year
- Nyc_Taxi_Data_Pipeline - DE Project☆96Updated 4 months ago
- data-warehouse-snowflake-for-data-engineering☆15Updated last year
- ☆43Updated 4 years ago