☆46Jul 6, 2024Updated last year
Alternatives and similar repositories for Data-Engineering-Streaming-Project
Users that are interested in Data-Engineering-Streaming-Project are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10May 5, 2022Updated 3 years ago
- Courses and projects on Data Camp☆11Jun 28, 2020Updated 5 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- Quickstart to Cilium☆17Oct 1, 2025Updated 6 months ago
- ☆30Nov 16, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the Data Engineering Zoomcamp☆20Dec 12, 2022Updated 3 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Sep 26, 2023Updated 2 years ago
- ☆13May 11, 2025Updated 11 months ago
- used Airflow, Postgres, Kafka, Spark, and Cassandra, and GitHub Actions to establish an end-to-end data pipeline☆32Oct 25, 2023Updated 2 years ago
- Contains spark dataframe solutions of leetcode questions☆24Dec 13, 2022Updated 3 years ago
- Branch Metrics Win32/C++ SDK☆10Jun 10, 2025Updated 10 months ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Dec 28, 2022Updated 3 years ago
- Modeling customer churn with Spark☆12Jan 24, 2019Updated 7 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆111Jan 8, 2026Updated 3 months ago
- Skunks Skool Tutorials☆17Dec 19, 2022Updated 3 years ago
- Repository for the Demo of using DVC with PyCaret & MLOps (DVC Office Hours - 20th Jan, 2022)☆11Jan 20, 2022Updated 4 years ago
- This repository about how to deploy machine learning model end serving with FastAPI and using MLFlow-MINIO☆18Jun 11, 2023Updated 2 years ago
- ☆13Oct 28, 2025Updated 6 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- ☆16Feb 17, 2020Updated 6 years ago
- ☆25Mar 18, 2025Updated last year
- Monotonic Optimal Binning algorithm is a statistical approach to transform continuous variables into optimal and monotonic categorical va…☆19Mar 27, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- End-to-End deployment of E-commerce customers segmentation using Clustering Machine learning algorithms in Google Cloud Platform and MLOp…☆19Jun 5, 2024Updated last year
- Apache Flink/Apache Kafka streaming data analytics demonstration using Streaming Synthetic Sales Data Generator☆15Jun 4, 2024Updated last year
- GitHub Action That Submits Argo Workflows For Execution on Your GKE Cluster☆16Jan 25, 2021Updated 5 years ago
- A data engineering project with Airflow, dbt, Terrafrom, GCP and much more!☆26Nov 8, 2022Updated 3 years ago
- Dockerfile for OpenLogReplicator☆21Mar 3, 2026Updated last month
- ☆12Mar 6, 2021Updated 5 years ago
- A sphinx extension for adding pyscript to a page☆15Updated this week
- The goal of this project is to analyse the impact of Covid-19 on the Aviation industry through data engineering processes using technolog…☆13Jun 26, 2022Updated 3 years ago
- Example of how to build machine learning training workflow on AWS by Prefect☆12Nov 2, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A repo to track data engineering projects☆13Nov 11, 2022Updated 3 years ago
- ☆16May 29, 2023Updated 2 years ago
- ☆17Feb 11, 2025Updated last year
- Data engineering project using UK Bus Open Data Service (BODS) to calculate late buses in real-time for any selected region in England. P…☆31Apr 2, 2023Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆48Apr 5, 2026Updated 3 weeks ago
- Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.☆14Nov 5, 2021Updated 4 years ago
- Implement different variants of gradient descent in python using numpy☆11Apr 23, 2019Updated 7 years ago