Distributed Model Serving Framework
☆188Apr 14, 2026Updated last month
Alternatives and similar repositories for modelmesh
Users that are interested in modelmesh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Controller for ModelMesh☆242Apr 14, 2026Updated last month
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆24Apr 14, 2026Updated last month
- Backend server for envd☆21Dec 18, 2023Updated 2 years ago
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,561Updated this week
- KServe models web UI☆49May 25, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- User documentation for KServe.☆112Updated this week
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13May 31, 2026Updated last week
- ModelMesh Performance Scripts, Dashboard and Pipelines☆13Apr 14, 2026Updated last month
- KServe community docs for contributions and process☆15May 7, 2026Updated last month
- A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.☆26Jan 2, 2025Updated last year
- Prototypes and experiments for WG Device Management.☆15May 21, 2026Updated 3 weeks ago
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Node.js binding for PyTorch.☆18Jun 27, 2021Updated 4 years ago
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more☆889Jun 4, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆20Jun 4, 2026Updated last week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆175Updated this week
- Helm Chart for Deploying Red Hat Developer Hub (Backstage). Community builds at https://redhat-developer.github.io/rhdh-chart/. Downstrea…☆26Updated this week
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆202Mar 24, 2022Updated 4 years ago
- ☆17Jun 4, 2026Updated last week
- KServe V2 Protocol Rest API Implementation Proxy☆14Mar 19, 2026Updated 2 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆326Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆131Updated this week
- Docker for Your ML/DL Models Based on OCI Artifacts☆473Jan 26, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,733Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,534Jun 4, 2026Updated last week
- Archived Cloud Pak for AIOps GitOps☆11Sep 17, 2025Updated 8 months ago
- This repository contains the code developed for the talk "AI at the Edge with MicroShift" developed by Miguel Angel Ajo and Ricardo Norie…☆16Nov 30, 2023Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆39Jul 19, 2023Updated 2 years ago
- Github integration with Knative Eventing.☆21Jun 2, 2026Updated last week
- This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/☆15Jun 4, 2026Updated last week
- Provides deploy scripts and CSI for Lustre.☆14Apr 13, 2026Updated last month
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆13May 16, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repository for open inference protocol specification☆72May 12, 2025Updated last year
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆27Apr 8, 2026Updated 2 months ago
- ☆14May 27, 2026Updated 2 weeks ago
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆513Updated this week
- Log macro for logs kv-unstable backend☆21Feb 23, 2021Updated 5 years ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago