logicalclocks / maggyLinks
Distribution transparent Machine Learning experiments on Apache Spark
β91Updated last year
Alternatives and similar repositories for maggy
Users that are interested in maggy are comparing it to the libraries listed below
Sorting:
- FlorDB π»β158Updated 3 months ago
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.β41Updated 2 years ago
- Distributed XGBoost on Rayβ152Updated last year
- A Scalable Auto-ML Systemβ55Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximationβ80Updated 2 years ago
- MLOps Platformβ272Updated last year
- ForML - A development framework and MLOps platform for the lifecycle management of data science projectsβ107Updated 2 years ago
- NitroML is a modular, portable, and scalable model-quality benchmarking framework for Machine Learning and Automated Machine Learning (Auβ¦β43Updated 4 years ago
- Projects developed by Domino's R&D teamβ77Updated 3 years ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.aiβ27Updated last week
- Ray-based Apache Beam runnerβ42Updated 2 years ago
- β59Updated 2 years ago
- XGBoost GPU accelerated on Spark example applicationsβ52Updated 3 years ago
- Willump Is a Low-Latency Useful Machine learning Platform.β45Updated 2 years ago
- Concept drift monitoring for HA model servers.β101Updated 2 years ago
- real-time data + ML pipelineβ53Updated last week
- MLOps Python Libraryβ121Updated 3 years ago
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalβ¦β251Updated 2 weeks ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ toβ¦β29Updated last year
- A deep ranking personalization frameworkβ133Updated last month
- Joblib Apache Spark Backendβ249Updated 10 months ago
- Inspect ML Pipelines in Python in the form of a DAGβ70Updated last year
- AutoBazaar: An AutoML System from the Machine Learning Bazaarβ33Updated 4 years ago
- Avro2TF is designed to fill the gap of making users' training data ready to be consumed by deep learning training frameworks.β128Updated 5 years ago
- Documentation and resources for deploying JupyterHub on Hadoopβ19Updated 6 years ago
- β96Updated 5 years ago
- Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemeral or serverless). Please see the documenβ¦β18Updated 2 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examplesβ72Updated 5 years ago
- An introductory tutorial about leveraging Ray core features for distributed patterns.β79Updated 2 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbationβ¦β164Updated last week