Distributed Model Serving Framework
☆187Sep 30, 2025Updated 5 months ago
Alternatives and similar repositories for modelmesh
Users that are interested in modelmesh are comparing it to the libraries listed below
Sorting:
- Controller for ModelMesh☆243Updated this week
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆22Updated this week
- Backend server for envd☆22Dec 18, 2023Updated 2 years ago
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,135Updated this week
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆11Apr 1, 2020Updated 5 years ago
- KServe models web UI☆47Feb 18, 2026Updated last week
- User documentation for KServe.☆109Updated this week
- Prototypes and experiments for WG Device Management.☆15Feb 11, 2026Updated 2 weeks ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 2 months ago
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more☆875Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆314Updated this week
- Node.js binding for PyTorch.☆18Jun 27, 2021Updated 4 years ago
- This repository contains statistics about the AI Infrastructure products.☆17Feb 27, 2025Updated last year
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆203Mar 24, 2022Updated 3 years ago
- A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.☆25Jan 2, 2025Updated last year
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆18Updated this week
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- GPU scheduler for elastic/distributed deep learning workloads in Kubernetes cluster (IC2E'23)☆34Nov 11, 2023Updated 2 years ago
- Docker for Your ML/DL Models Based on OCI Artifacts☆474Jan 26, 2024Updated 2 years ago
- Cloud Native Machine Learning Model Registry☆82Jan 12, 2023Updated 3 years ago
- Example DRA driver that developers can fork and modify to get them started writing their own.☆120Updated this week
- Cloud Native ML/DL Platform☆133Sep 9, 2020Updated 5 years ago
- ☆15Aug 7, 2025Updated 6 months ago
- ☆34Jan 30, 2026Updated last month
- Idle containers when not handling requests.☆41Mar 31, 2023Updated 2 years ago
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆31Updated this week
- Provides deploy scripts and CSI for Lustre.☆14Oct 27, 2025Updated 4 months ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- A toolkit to run Ray applications on Kubernetes☆2,341Feb 23, 2026Updated last week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆165Updated this week
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 2 years ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 3 months ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,393Updated this week
- This is a collection of the EMC storage platform drivers for ClusterHQ's Flocker☆12Oct 19, 2016Updated 9 years ago
- CPU DRA Driver☆32Feb 9, 2026Updated 2 weeks ago
- d.run website☆15Feb 9, 2026Updated 3 weeks ago
- wsc is a library that allows to interact with web sockets using channels.☆13Dec 4, 2025Updated 2 months ago
- ☆13Oct 7, 2025Updated 4 months ago