Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performance.
☆40Mar 20, 2026Updated this week
Alternatives and similar repositories for modelexpress
Users that are interested in modelexpress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Prototypes and experiments for WG Device Management.☆15Mar 9, 2026Updated 2 weeks ago
- WG Serving☆34Mar 5, 2026Updated 2 weeks ago
- CPU DRA Driver☆35Mar 12, 2026Updated last week
- d.run website☆16Mar 13, 2026Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 10 months ago
- Operator for the mutating admission webhook for ClusterResourceOverride☆18Mar 13, 2026Updated last week
- ☆15Mar 6, 2025Updated last year
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- Mesos最佳实践指南(Mesos Handbook)☆12Jan 1, 2018Updated 8 years ago
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆19Updated this week
- ☆135Mar 13, 2026Updated last week
- A Mechanistic View on Video Generation as World Models: State and Dynamics☆31Mar 9, 2026Updated 2 weeks ago
- Benchmark SGLang on SLURM☆22Updated this week
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆168Updated this week
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 10 months ago
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- [ICME-2022] Official implementations of Localizing Semantic Patches for Accelerating Image Classification☆16Jul 1, 2022Updated 3 years ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 4 months ago
- AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…☆182Updated this week
- ☆34Mar 1, 2026Updated 3 weeks ago
- Some parallel and less known but good data structure☆23Nov 11, 2024Updated last year
- Code snippets and reproductions from JustAByte☆25Jan 25, 2026Updated last month
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆99Feb 15, 2026Updated last month
- Converging Computing Resources Like an Ocean☆22Jan 16, 2026Updated 2 months ago
- 🐳🧩 Easy to use MCP builder & launcher for all possible MCP servers, just like Ollama for models!☆39Apr 30, 2025Updated 10 months ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆14Jul 28, 2025Updated 7 months ago
- ☆14Feb 21, 2026Updated last month
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆26May 21, 2025Updated 10 months ago
- CMU15445 本人的代码实现☆26Mar 20, 2023Updated 3 years ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- Running and managing Wasm(actors) and capability providers in Kubernetes☆31Dec 12, 2023Updated 2 years ago
- Experimental DRA driver bringing CNI closer to Kubernetes☆39Oct 1, 2025Updated 5 months ago
- ☆44Updated this week
- A SystemVerilog implementation of MIPS32 CPU and RIP router☆22Jan 12, 2020Updated 6 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆25Feb 19, 2026Updated last month
- ☆48Dec 8, 2025Updated 3 months ago