logicalclocks / maggy
Distribution transparent Machine Learning experiments on Apache Spark
☆90Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for maggy
- Distributed XGBoost on Ray☆144Updated 4 months ago
- Utility Library for Hopsworks. Issues can be posted at https://community.hopsworks.ai☆27Updated 5 months ago
- Python - Java/Scala API for the Hopsworks feature store☆53Updated this week
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year
- 🌻 Flow with FlorDB☆151Updated 2 months ago
- Concept drift monitoring for HA model servers.☆101Updated last year
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- ByteHub: making feature stores simple☆58Updated 3 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated this week
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆165Updated 2 months ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops☆117Updated last year
- MLOps Python Library☆116Updated 2 years ago
- Scaling Python Machine Learning☆44Updated last year
- Lambda Learner is a library for iterative incremental training of a class of supervised machine learning models.☆42Updated last year
- A collection of Machine Learning examples to get started with deploying RAPIDS in the Cloud☆138Updated 2 weeks ago
- Ray provider for Apache Airflow☆47Updated 9 months ago
- A JSON-based schema for storing declarative descriptions of machine learning experiments☆45Updated 7 years ago
- A simple, extensible library for developing AutoML systems☆173Updated last year
- Point-in-Time optimizations for Apache Spark☆29Updated 10 months ago
- ForML - A development framework and MLOps platform for the lifecycle management of data science projects☆104Updated last year
- Ray-based Apache Beam runner☆42Updated last year
- An introductory tutorial about leveraging Ray core features for distributed patterns.☆77Updated last year
- ☆30Updated 3 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 2 months ago
- real-time data + ML pipeline☆54Updated this week
- Joblib Apache Spark Backend☆242Updated 3 months ago
- Distributed skorch on Ray Train☆57Updated 2 years ago
- The Data Linter identifies potential issues (lints) in your ML training data.☆87Updated 6 years ago