airscholar / ApacheFlink-SalesAnalytics
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ApacheFlink-SalesAnalytics
- This project shows how to capture changes from postgres database and stream them into kafka☆31Updated 6 months ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆31Updated 11 months ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆29Updated 10 months ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆37Updated 11 months ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆16Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆15Updated 9 months ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆23Updated 11 months ago
- ☆60Updated last week
- Simple stream processing pipeline☆92Updated 5 months ago
- A custom end-to-end data pipeline for customer churn☆9Updated 3 weeks ago
- ☆29Updated 11 months ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- data-warehouse-snowflake-for-data-engineering☆14Updated last year
- ☆27Updated last year
- This project provides a comprehensive data pipeline solution to extract, transform, and load (ETL) Reddit data into a Redshift data wareh…☆63Updated last year
- Data Engineering with Scala, published by Packt☆19Updated 9 months ago
- An end-to-end data engineering pipeline that fetches real-time YouTube analytics and streams them through Kafka for processing with ksqlD…☆10Updated last year
- Code for blog at https://www.startdataengineering.com/post/python-for-de/☆55Updated 5 months ago
- Data Engineering on GCP☆30Updated 2 years ago
- Step by step instructions to create a production-ready data pipeline☆27Updated last month
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆27Updated 9 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆204Updated last year
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆18Updated 2 months ago
- Code snippets for Data Engineering Design Patterns book☆40Updated last week
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆82Updated last year
- A collection of data engineering projects: data modeling, ETL pipelines, data lakes, infrastructure configuration on AWS, data warehousin…☆14Updated 3 years ago
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆22Updated last year
- Build & Learn Data Engineering,Machine Learning over Kubernetes. No Shortcut approach.☆58Updated last year
- End to end data engineering project☆51Updated 2 years ago