Distributed Model Serving Framework
☆189Apr 14, 2026Updated 2 months ago
Alternatives and similar repositories for modelmesh
Users that are interested in modelmesh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Controller for ModelMesh☆245Apr 14, 2026Updated 2 months ago
- Unified runtime-adapter image of the sidecar containers which run in the modelmesh pods☆24Apr 14, 2026Updated 2 months ago
- Backend server for envd☆21Dec 18, 2023Updated 2 years ago
- Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes☆5,631Updated this week
- KServe models web UI☆50Jun 12, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- User documentation for KServe.☆112Jun 22, 2026Updated last week
- ModelMesh Performance Scripts, Dashboard and Pipelines☆13Apr 14, 2026Updated 2 months ago
- KServe community docs for contributions and process☆15Jun 20, 2026Updated last week
- A library developed by Volcano Engine for high-performance reading and writing of PyTorch model files.☆26Jan 2, 2025Updated last year
- Prototypes and experiments for WG Device Management.☆16May 21, 2026Updated last month
- Custom Scheduler to deploy ML models to TRTIS for GPU Sharing☆12Apr 1, 2020Updated 6 years ago
- Node.js binding for PyTorch.☆18Jun 27, 2021Updated 5 years ago
- An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more☆890Jun 24, 2026Updated last week
- The kernel module management operator builds, signs and loads kernel modules on OpenShift.☆31Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆20Updated this week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆173Jun 24, 2026Updated last week
- Helm Chart for Deploying Red Hat Developer Hub (Backstage). Community builds at https://redhat-developer.github.io/rhdh-chart/. Downstrea…☆27Updated this week
- Cloud-native way to provide elastic Jupyter Notebooks on Kubernetes. Run remote kernels, natively.☆202Mar 24, 2022Updated 4 years ago
- A basic golang app with a travis pipeline that deploys into a k8s cluster using Argo-CD☆14Aug 24, 2018Updated 7 years ago
- KServe V2 Protocol Rest API Implementation Proxy☆14Mar 19, 2026Updated 3 months ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆329Jun 23, 2026Updated last week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆135Updated this week
- Code for generating synthetic Japanese text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta…☆13Aug 30, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Docker for Your ML/DL Models Based on OCI Artifacts☆473Jan 26, 2024Updated 2 years ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,776Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,562Updated this week
- This repository contains the code developed for the talk "AI at the Edge with MicroShift" developed by Miguel Angel Ajo and Ricardo Norie…☆16Nov 30, 2023Updated 2 years ago
- GPU Environment Management for Visual Studio Code☆39Jul 19, 2023Updated 2 years ago
- Github integration with Knative Eventing.☆21Jun 16, 2026Updated 2 weeks ago
- This repo contains the follow-along student instructions for the lab. https://rhoai-mlops.github.io/lab-instructions/☆15Jun 18, 2026Updated last week
- Provides deploy scripts and CSI for Lustre.☆14Apr 13, 2026Updated 2 months ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆13May 16, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository for open inference protocol specification☆74May 12, 2025Updated last year
- Cloud Native ML/DL Platform☆132Sep 9, 2020Updated 5 years ago
- LoRAFusion: Efficient LoRA Fine-Tuning for LLMs☆28Apr 8, 2026Updated 2 months ago
- ☆14May 27, 2026Updated last month
- Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…☆516Updated this week
- Log macro for logs kv-unstable backend☆22Feb 23, 2021Updated 5 years ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago