Automatic tuning for ML model deployment on Kubernetes
☆80Nov 1, 2024Updated last year
Alternatives and similar repositories for morphling
Users that are interested in morphling are comparing it to the libraries listed below
Sorting:
- Run your deep learning workloads on Kubernetes more easily and efficiently.☆531Mar 4, 2024Updated 2 years ago
- GPU topology-aware scheduler☆13Jul 7, 2017Updated 8 years ago
- RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs (DAC'23)☆11Apr 13, 2023Updated 2 years ago
- Kubernetes Operator for AI and Bigdata Elastic Training☆91Jan 10, 2025Updated last year
- Common APIs and libraries shared by other Kubeflow operator repositories.☆53May 28, 2023Updated 2 years ago
- GPU-scheduler-for-deep-learning☆210Nov 5, 2020Updated 5 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- Kubernetes Scheduler for Deep Learning☆264May 22, 2022Updated 3 years ago
- ☆539Jun 7, 2024Updated last year
- A SapientML plugin of SapientMLGenerator☆11Dec 23, 2025Updated 2 months ago
- Kubernetes Scheduler Simulator☆125Jul 31, 2024Updated last year
- An example of kubernetes scheduler extender☆15Apr 12, 2019Updated 6 years ago
- A Kubernetes operator for mxnet jobs☆52Dec 1, 2021Updated 4 years ago
- Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020☆137Jul 25, 2024Updated last year
- GPU Sharing Device Plugin for Kubernetes Cluster☆492Jan 10, 2023Updated 3 years ago
- Helios Traces from SenseTime☆61Sep 27, 2022Updated 3 years ago
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- GPU Sharing Scheduler for Kubernetes Cluster☆1,528Dec 29, 2023Updated 2 years ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- Simulated large clusters for Kubernetes scheduler validation.☆15Jan 3, 2023Updated 3 years ago
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆39Jun 11, 2024Updated last year
- ☆892Apr 2, 2024Updated last year
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆41Oct 28, 2017Updated 8 years ago
- Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)☆516Feb 23, 2026Updated last week
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- ☆12Jun 14, 2020Updated 5 years ago
- Repository for AI model benchmarking on TT-Buda☆15Feb 9, 2026Updated 3 weeks ago
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 6 months ago
- Examples of inference pipelines implemented using https://github.com/SeldonIO/seldon-core☆14Feb 1, 2023Updated 3 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- Resource-adaptive cluster scheduler for deep learning training.☆454Mar 5, 2023Updated 2 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- ☆198Aug 31, 2019Updated 6 years ago
- ModelMesh Performance Scripts, Dashboard and Pipelines☆12May 12, 2025Updated 9 months ago
- ☆19Jun 4, 2024Updated last year
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- ☆15Jul 3, 2025Updated 8 months ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆17Sep 27, 2021Updated 4 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago