airscholar / ApacheFlink-SalesAnalyticsLinks
This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project demonstrates how to ingest, process, and analyze sales data, showcasing the capabilities of Apache Flink for big data processing.
☆11Updated 2 years ago
Alternatives and similar repositories for ApacheFlink-SalesAnalytics
Users that are interested in ApacheFlink-SalesAnalytics are comparing it to the libraries listed below
Sorting:
- This project shows how to capture changes from postgres database and stream them into kafka☆40Updated last year
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆15Updated 2 years ago
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆47Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆45Updated 2 years ago
- Apache Airflow advanced functionalities examples☆21Updated last year
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆44Updated 2 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆38Updated 2 years ago
- A custom end-to-end analytics platform for customer churn☆11Updated 8 months ago
- ☆19Updated 2 years ago
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Updated 2 years ago
- This project demonstrates how to use Apache Airflow to submit jobs to Apache spark cluster in different programming laguages using Python…☆48Updated last year
- ☆70Updated last week
- Simple stream processing pipeline☆110Updated last year
- End to end data engineering project☆58Updated 3 years ago
- A demonstration of an ELT (Extract, Load, Transform) pipeline☆31Updated last year
- A complete data engineering project demonstrating modern data stack practices with Apache Flink, Iceberg, Trino and Superset☆20Updated 4 months ago
- ☆28Updated 10 months ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆20Updated last year
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 3 years ago
- ☆15Updated 2 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆312Updated 11 months ago
- Analytics engineering with dbt - projects and developer environment☆22Updated last year
- Rust And Delta Demo. Explanation and walkthrough on delta-rs☆10Updated 2 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆36Updated last month
- Bigdata on Kubernetes, Published by Packt☆36Updated last year
- Duke MIDS: Data Engineering and DataOps Course☆69Updated last year
- Apache Airflow Best Practices, published by Packt☆50Updated last year
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆24Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆28Updated 2 years ago
- Repo which holds the materials for the EMR Zero To Hero☆27Updated 3 years ago