logicalclocks / maggyLinks
Distribution transparent Machine Learning experiments on Apache Spark
β91Updated last year
Alternatives and similar repositories for maggy
Users that are interested in maggy are comparing it to the libraries listed below
Sorting:
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.β42Updated 2 years ago
- Flow with FlorDB π»β154Updated 2 weeks ago
- A Scalable Auto-ML Systemβ53Updated 2 years ago
- XGBoost GPU accelerated on Spark example applicationsβ53Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximationβ76Updated 2 years ago
- Distributed XGBoost on Rayβ149Updated last year
- MLOps Platformβ272Updated 10 months ago
- β58Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ106Updated 2 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.aiβ27Updated last year
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.β128Updated 5 years ago
- Projects developed by Domino's R&D teamβ77Updated 3 years ago
- Ray-based Apache Beam runnerβ41Updated 2 years ago
- β30Updated 4 years ago
- real-time data + ML pipelineβ54Updated last week
- Willump Is a Low-Latency Useful Machine learning Platform.β44Updated 2 years ago
- Joblib Apache Spark Backendβ249Updated 5 months ago
- Concept drift monitoring for HA model servers.β102Updated 2 years ago
- AutoBazaar: An AutoML System from the Machine Learning Bazaarβ33Updated 4 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbationβ¦β165Updated 2 months ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Auβ¦β43Updated 4 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalβ¦β248Updated 2 weeks ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Updated 9 months ago
- β40Updated 9 years ago
- Documentation and resources for deploying JupyterHub on Hadoopβ19Updated 6 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examplesβ72Updated 5 years ago
- RL-Bakery makes it easy to build production, large scale, batch Deep Reinforcement Learning applications.β92Updated 11 months ago
- Inspect ML Pipelines in Python in the form of a DAGβ70Updated last year
- The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning woβ¦β171Updated 2 years ago
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clientsβ37Updated 4 years ago