tatoliop / PROUD-PaRallel-OUtlier-Detection-for-streamsLinks
PROUD is an open-source high-throughput distributed outlier detection engine for intense data streams that is implemented in Scala on top of the Apache Flink framework.
☆12Updated 3 years ago
Alternatives and similar repositories for PROUD-PaRallel-OUtlier-Detection-for-streams
Users that are interested in PROUD-PaRallel-OUtlier-Detection-for-streams are comparing it to the libraries listed below
Sorting:
- real-time data + ML pipeline☆54Updated this week
- Documentation for Hopsworks and Hops☆11Updated 3 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- Making Machine Learning Simple and Scalable with Python, Jupyter Notebook, TensorFlow, Keras, Apache Kafka and KSQL☆96Updated 6 years ago
- Code to solve a open dataset of predictive maintanance of sheet brek on a paper mill.☆8Updated 4 years ago
- Best practices for engineering ML pipelines.☆35Updated 3 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML☆41Updated 7 months ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Updated 6 years ago
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆121Updated 4 years ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- ☆12Updated 3 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆45Updated 2 years ago
- Timeseries Anomaly detection and Root Cause Analysis on data in SQL data warehouses and databases☆230Updated 3 years ago
- Scaling Python Machine Learning☆47Updated last year
- ForestFlow is a policy-driven Machine Learning Model Server. It is an LF AI Foundation incubation project.☆73Updated last year
- Parallel approach on distance based outlier detection on streaming data☆10Updated 5 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Updated last year
- Anomaly detection for streaming time series, featuring automated model selection.☆209Updated last year
- Python - Java/Scala API for the Hopsworks feature store☆54Updated last week
- AutoBazaar: An AutoML System from the Machine Learning Bazaar☆33Updated 4 years ago
- Using Kafka-Python to illustrate a ML production pipeline☆112Updated 2 years ago
- Awesome list of AutoML frameworks - curated by @oskar-j☆29Updated 2 years ago
- Projects developed by Domino's R&D team☆78Updated 3 years ago
- Online Time Series Anomaly Detectors☆29Updated 2 years ago
- RedisAI integration for MLFlow☆30Updated 2 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated 2 years ago
- Simple project using pyflink, kafka and postgre containerized using Docker☆11Updated 11 months ago
- ☆30Updated 3 years ago