used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline
☆32Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for Data-Streaming-Project
Users that are interested in Data-Streaming-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆46Jul 6, 2024Updated last year
- ☆19Nov 5, 2024Updated last year
- ☆12Mar 6, 2021Updated 5 years ago
- The package to wrap Aiport server implementation (DuckDB Airport Extension)☆46Apr 6, 2026Updated 3 weeks ago
- ☆15Dec 22, 2016Updated 9 years ago
- ☆19Jun 22, 2022Updated 3 years ago
- A well-documented explanation of data structure types including Linked List, Hash table, Binary Tree, Queues, Stack☆13Jul 30, 2022Updated 3 years ago
- Static site to create tactical animations☆10Sep 2, 2021Updated 4 years ago
- Docker Apache Airflow☆13Mar 1, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Mar 14, 2024Updated 2 years ago
- ☆24Dec 4, 2023Updated 2 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆28Oct 1, 2021Updated 4 years ago
- A simple, customizable, and modern library for displaying alert banners in your Jetpack Compose, Compose Multiplatform and native iOS (Sw…☆57Apr 10, 2026Updated 3 weeks ago
- Automate Budget Planning with Linear Programming☆15Jan 3, 2026Updated 3 months ago
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Fully dockerized Data Warehouse (DWH) using Airflow, dbt, PostgreSQL and dashboard using redash☆25Nov 12, 2022Updated 3 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- It is a assemble to include all Practice Projects about Big Data Topic, includes Hadoop, Spark, Spark Streaming and Kafka☆11Mar 7, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆33Nov 9, 2023Updated 2 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- Demonstrates how to schedule GitHub Workflows to run scripts for monitoring product availability on the shopee.com☆26Updated this week
- ☆29Apr 24, 2026Updated last week
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 8 months ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆66Mar 9, 2024Updated 2 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆27Aug 28, 2023Updated 2 years ago
- ☆24Jan 2, 2026Updated 4 months ago
- Flask based Movie Recommendation System☆12May 1, 2023Updated 3 years ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- ☆21Oct 1, 2021Updated 4 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆323Feb 14, 2025Updated last year