Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performance.
☆56May 1, 2026Updated this week
Alternatives and similar repositories for modelexpress
Users that are interested in modelexpress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Prototypes and experiments for WG Device Management.☆15Apr 1, 2026Updated last month
- WG Serving☆35Mar 24, 2026Updated last month
- CPU DRA Driver☆47Apr 22, 2026Updated last week
- d.run website☆17Apr 20, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated last year
- Operator for the mutating admission webhook for ClusterResourceOverride☆19Apr 15, 2026Updated 2 weeks ago
- ☆15Mar 6, 2025Updated last year
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Mesos最佳实践指南(Mesos Handbook)☆12Jan 1, 2018Updated 8 years ago
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆20Updated this week
- ☆142Apr 23, 2026Updated last week
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆198Updated this week
- ☆35Apr 20, 2026Updated last week
- AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…☆253Updated this week
- 🐳🧩 Easy to use MCP builder & launcher for all possible MCP servers, just like Ollama for models!☆41Apr 30, 2025Updated last year
- ☆13Apr 10, 2026Updated 3 weeks ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 9 months ago
- Running and managing Wasm(actors) and capability providers in Kubernetes☆31Dec 12, 2023Updated 2 years ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code snippets and reproductions from JustAByte☆45Apr 6, 2026Updated 3 weeks ago
- ☆49Updated this week
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- ☆13Jun 17, 2019Updated 6 years ago
- 2023 中国开源年度报告;2023 China Open Source Report☆16Mar 9, 2026Updated last month
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- ☆15Nov 17, 2015Updated 10 years ago
- Exploring how optimizations for GEMMs work☆31Feb 28, 2026Updated 2 months ago
- GaussDB driver and toolkit for Go☆15Dec 17, 2025Updated 4 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Staging repo for CSI Migration/Translation libraries☆14Apr 26, 2026Updated last week
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆117Feb 15, 2026Updated 2 months ago
- This is a collection of the EMC storage platform drivers for ClusterHQ's Flocker☆12Oct 19, 2016Updated 9 years ago
- ☆16May 14, 2025Updated 11 months ago
- tee-like program that tee-s stdin to a rotated log file(s) and can compress them.☆15Jan 28, 2018Updated 8 years ago
- A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier …☆15Feb 6, 2026Updated 2 months ago
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec☆73Updated this week