used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline
☆32Oct 25, 2023Updated 2 years ago
Alternatives and similar repositories for Data-Streaming-Project
Users that are interested in Data-Streaming-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆21Jul 26, 2024Updated last year
- The unique data management platform for Julia☆16Apr 25, 2022Updated 4 years ago
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Flutter file encryption☆13Jun 20, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17May 1, 2023Updated 3 years ago
- Covid19 and Iowa Liquor Sales analysis at BigQuery using dbt, Airflow, Marquez, Google Cloud and other modern data stack tools☆14Jun 18, 2022Updated 3 years ago
- ☆47Jul 6, 2024Updated last year
- ☆19Nov 5, 2024Updated last year
- ☆11Sep 1, 2020Updated 5 years ago
- ☆12Mar 6, 2021Updated 5 years ago
- ☆66Aug 6, 2024Updated last year
- Demo code for face recognition module of opencv in java☆13Dec 27, 2018Updated 7 years ago
- Heard of Machine Learning? What's it all about? 🤷🏾♀️ This repo will contain tutorials on different models required to give you an in…☆15Nov 14, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- Docker Apache Airflow☆13Mar 1, 2023Updated 3 years ago
- E-commerce application with TDD and BLOC☆14Oct 1, 2023Updated 2 years ago
- Single-click deployment, serverless data pipeline that moves Google Analytics raw data to S3 and ETL's it into BigQuery schema☆20Jun 2, 2021Updated 4 years ago
- ☆15Mar 14, 2024Updated 2 years ago
- ☆24Dec 4, 2023Updated 2 years ago
- Scripts and tooling to migrate DW and Spark workloads to Fabric.☆29Apr 9, 2024Updated 2 years ago
- A backtest a day keeps the losses away!☆15Sep 11, 2023Updated 2 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆28Oct 1, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple, customizable, and modern library for displaying alert banners in your Jetpack Compose, Compose Multiplatform and native iOS (Sw…☆57Apr 10, 2026Updated last month
- A data pipeline moving data from a Relational database system (RDBMS) to a Hadoop file system (HDFS).☆15Jun 3, 2021Updated 4 years ago
- Shortest path computation using Go and Contraction Hierarchies.☆13Nov 27, 2015Updated 10 years ago
- Glue ETL job or EMR Spark that gets from data catalog, modifies and uploads to S3 and Data Catalog☆13Aug 26, 2023Updated 2 years ago
- This repo is for the Linkedin Learning course: End-to-End Data Engineering Project☆33Nov 9, 2023Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆29Aug 18, 2024Updated last year
- A platform that helps developers to better understand CSS through declaration interpretation and may even improve them through suggestion…☆14Jul 3, 2021Updated 4 years ago
- TTS utility☆12Aug 2, 2020Updated 5 years ago
- Spark Notebook docker image☆10Dec 29, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A structured streaming was applied to the robot data from ROS-Gazebo simulation environment using Apache Spark. Data is collected in Kafk…☆19Feb 6, 2022Updated 4 years ago
- velib-v2: An ETL pipeline that employs batch and streaming jobs using Spark, Kafka, Airflow, and other tools, all orchestrated with Docke…☆20Aug 12, 2025Updated 9 months ago
- Data-Science-Projects-in-Python☆11Jul 25, 2018Updated 7 years ago
- A lightweight, open-source UI for dbt that provides model browsing, lineage visualization, run orchestration, documentation previews, and…☆57Mar 18, 2026Updated 2 months ago
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆12Nov 18, 2023Updated 2 years ago
- Ecommerce Realtime Data Pipeline (Data Modeling, Workflow Orchestration, Change Data Capture, Analytical Database and Dashboarding)☆67Mar 9, 2024Updated 2 years ago
- 🚀 A simple javascript template for rapid development of GitHub actions.☆17Feb 24, 2023Updated 3 years ago