Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performance.
☆64May 22, 2026Updated this week
Alternatives and similar repositories for modelexpress
Users that are interested in modelexpress are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 3 years ago
- WG Serving☆35Mar 24, 2026Updated last month
- CPU DRA Driver☆49May 13, 2026Updated last week
- d.run website☆17May 13, 2026Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆27Apr 24, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Operator for the mutating admission webhook for ClusterResourceOverride☆19Apr 15, 2026Updated last month
- ☆15Mar 6, 2025Updated last year
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12May 15, 2026Updated last week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Sep 16, 2022Updated 3 years ago
- 🧘 Extensive LLM endpoints, expended capabilities through your favorite protocols, 🕸️ GraphQL, ↔️ gRPC, ♾️ WebSocket. Extended SOTA supp…☆20Updated this week
- ☆143May 8, 2026Updated 2 weeks ago
- It is very easy to switch from Docker Shim to CRI Dockerd and back☆31Oct 30, 2023Updated 2 years ago
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆17May 14, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆209Updated this week
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆30Nov 21, 2025Updated 6 months ago
- ☆36Apr 30, 2026Updated 3 weeks ago
- 🐳🧩 Easy to use MCP builder & launcher for all possible MCP servers, just like Ollama for models!☆42Apr 30, 2025Updated last year
- AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…☆320Updated this week
- Running and managing Wasm(actors) and capability providers in Kubernetes☆32Dec 12, 2023Updated 2 years ago
- Experimental DRA driver bringing CNI closer to Kubernetes☆43Oct 1, 2025Updated 7 months ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- ☆52Mar 25, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆28Feb 19, 2026Updated 3 months ago
- ☆50May 13, 2026Updated last week
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- ☆13Jun 17, 2019Updated 6 years ago
- 2023 中国开源年度报告;2023 China Open Source Report☆16Mar 9, 2026Updated 2 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- ☆15Nov 17, 2015Updated 10 years ago
- Exploring how optimizations for GEMMs work☆33Feb 28, 2026Updated 2 months ago
- Mojo Miji | A guide to Mojo programming language from a Pythonista's perspective | Mojo 秘籍☆30May 7, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Simplified Data Management and Sharing for Kubernetes☆18May 13, 2026Updated last week
- GaussDB driver and toolkit for Go☆15Dec 17, 2025Updated 5 months ago
- Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)☆10Apr 29, 2024Updated 2 years ago
- Pytorch implementation of the paper: Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement.☆10Oct 17, 2020Updated 5 years ago
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆120Feb 15, 2026Updated 3 months ago
- This is a collection of the EMC storage platform drivers for ClusterHQ's Flocker☆12Oct 19, 2016Updated 9 years ago
- ☆16May 14, 2025Updated last year