logicalclocks / maggyLinks
Distribution transparent Machine Learning experiments on Apache Spark
β91Updated last year
Alternatives and similar repositories for maggy
Users that are interested in maggy are comparing it to the libraries listed below
Sorting:
- FlorDB π»β155Updated last month
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.β41Updated 2 years ago
- Distributed XGBoost on Rayβ152Updated last year
- A Scalable Auto-ML Systemβ55Updated 2 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximationβ78Updated 2 years ago
- MLOps Platformβ272Updated last year
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.aiβ27Updated last year
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbationβ¦β164Updated 5 months ago
- Concept drift monitoring for HA model servers.β101Updated 2 years ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ106Updated 2 years ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.β128Updated 5 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Auβ¦β43Updated 4 years ago
- MLOps Python Libraryβ120Updated 3 years ago
- XGBoost GPU accelerated on Spark example applicationsβ52Updated 3 years ago
- β31Updated 4 years ago
- β59Updated last year
- Inspect ML Pipelines in Python in the form of a DAGβ70Updated last year
- Projects developed by Domino's R&D teamβ77Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoopβ19Updated 6 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalβ¦β249Updated 3 weeks ago
- Coarse-grained lineage and tracing for machine learning pipelines.β469Updated 3 years ago
- Ray-based Apache Beam runnerβ42Updated 2 years ago
- π¦ Deployment tool for online machine learning modelsβ98Updated 3 years ago
- Joblib Apache Spark Backendβ249Updated 8 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Updated 11 months ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examplesβ71Updated 5 years ago
- Python - Java/Scala API for the Hopsworks feature storeβ54Updated 2 months ago
- A simple, extensible library for developing AutoML systemsβ175Updated 2 years ago
- π Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durationsβ45Updated 2 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.β45Updated 2 years ago