rodrigo-arenas / kafkaml-anomaly-detection
Project for real-time anomaly detection using Kafka and python
☆59Updated 2 years ago
Alternatives and similar repositories for kafkaml-anomaly-detection:
Users that are interested in kafkaml-anomaly-detection are comparing it to the libraries listed below
- ☆29Updated last year
- ☆40Updated 7 months ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated last year
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆46Updated last year
- ☆87Updated 2 years ago
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆34Updated last year
- A Postgres data warehouse for processing synthetic data using IAC principles☆16Updated last year
- Kafka variant of the MLOps Level 1 stack☆24Updated 2 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- ☆35Updated 2 years ago
- This is my Apache Airflow Local development setup on Windows 10 WSL2/Mac using docker-compose. It will also include some sample DAGs and …☆27Updated last year
- End to end data engineering project with kafka, airflow, spark, postgres and docker.☆78Updated 6 months ago
- This project focuses on building a robust data pipeline using Apache Airflow to automate the ingestion of weather data from the OpenWeath…☆21Updated last year
- Simple stream processing pipeline☆98Updated 7 months ago
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- Project for "Data pipeline design patterns" blog.☆43Updated 6 months ago
- Simple ETL pipeline using Python☆25Updated last year
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆52Updated 2 years ago
- Testing Spark Structured Streaming anf Kafka with real data from traffic sensors☆16Updated 2 years ago
- Built a stream processing data pipeline to get data from disparate systems into a dashboard using Kafka as an intermediary.☆29Updated last year
- Writes the CSV file to Postgres, read table and modify it. Write more tables to Postgres with Airflow.☆36Updated last year
- This repository contains an Apache Flink application for real-time sales analytics built using Docker Compose to orchestrate the necessar…☆41Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago
- A list of all my posts and personal projects☆69Updated 8 months ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆84Updated 5 years ago
- Processing TfL data for bike usage with Google Cloud Platform.☆44Updated 2 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- End to end data engineering project☆53Updated 2 years ago
- Udacity Data Streaming Nanodegree Program☆22Updated 3 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆59Updated last year