Engine-agnostic LLM gateway in Rust. Full OpenAI & Anthropic API compatibility across vLLM, SGLang, TRT-LLM, OpenAI, Gemini & more. Industry-first gRPC pipeline, KV cache-aware routing, chat history, tokenization caching, Responses API, embeddings, WASM plugins, MCP, and multi-tenant auth.
☆108Mar 19, 2026Updated this week
Alternatives and similar repositories for smg
Users that are interested in smg are comparing it to the libraries listed below
Sorting:
- Incubating P/D sidecar for llm-d☆16Nov 13, 2025Updated 4 months ago
- This CLI tool and Python3 module collects the current system state for documentation☆22Updated this week
- The living Trust and Safety User Guide for the AI Alliance (https://thealliance.ai)☆15Feb 15, 2026Updated last month
- Kubernetes CSI Driver for serving OCI model artifacts☆24Mar 11, 2026Updated last week
- A userspace filesystem backing by Apache OpenDAL.☆37Jan 8, 2026Updated 2 months ago
- HeraclesQL is a Python DSL for writing alerts!☆26Dec 3, 2025Updated 3 months ago
- Mini-pytorch implemented from scratch using Python☆14Jan 19, 2022Updated 4 years ago
- ☆41Aug 21, 2025Updated 6 months ago
- /j f t/ - YAML file tool☆13Feb 9, 2026Updated last month
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆29Jan 22, 2026Updated last month
- Expert Specialization MoE Solution based on CUTLASS☆27Jan 19, 2026Updated 2 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆20Updated this week
- IBM Z Deep Neural Network Library (zDNN) provides an interface for applications making use of Neural Network Processing Assist Facility (…☆20Sep 17, 2025Updated 6 months ago
- A similarity measurer on two programming assignments on Online Judge.☆10Jan 6, 2023Updated 3 years ago
- A storage plugin that provided CRI-O/Podman with the ability to lazy mount nydus images.☆42May 12, 2025Updated 10 months ago
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- Userscripts and userstyles with quality of life improvements for Bitbucket, Jira, and Confluence☆27Mar 13, 2026Updated last week
- Resource Exporter for volcano scheduling, e.g. NUMA-Aware scheduling.☆19May 30, 2025Updated 9 months ago
- a simple traceroute tool for iOS☆14Oct 25, 2017Updated 8 years ago
- 能够远程办公(work from home)的公司名单☆16Mar 2, 2022Updated 4 years ago
- A Rust reimplementation of genai-bench for benchmarking LLM serving systems at high concurrency with accurate timing and industry-standar…☆279Updated this week
- SCU Virtual Judge☆11Feb 16, 2023Updated 3 years ago
- Adapted iPerf3 iOS sample☆12Mar 15, 2017Updated 9 years ago
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆397Updated this week
- Workflow Defined Engine☆25Nov 4, 2025Updated 4 months ago
- Linux for Powerpc mirror☆27Updated this week
- Fast and memory-efficient exact attention☆20Mar 13, 2026Updated last week
- Deploy Kubernetes Using OpenStack Ironic☆11Jul 27, 2017Updated 8 years ago
- Codebase for Cuda Learning☆31Jul 13, 2024Updated last year
- KV cache store for distributed LLM inference☆399Nov 13, 2025Updated 4 months ago
- LLVM compile-tracking tracking infrastructure☆44Feb 22, 2026Updated 3 weeks ago
- RaptorJIT: a dynamic system programming language (manuscript)☆16Jun 4, 2019Updated 6 years ago
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated 11 months ago
- Cloyster HPC is a turnkey HPC cluster solution with an user-friendly installer☆10Oct 2, 2025Updated 5 months ago
- A distributed system for Agentic AI☆49Updated this week
- Lustre Repository with MS patches☆15Mar 12, 2026Updated last week
- An OS kernel module for fast **remote** fork using advanced datacenter networking (RDMA).☆71Feb 15, 2025Updated last year
- A lightweight design for computation-communication overlap.☆225Jan 20, 2026Updated 2 months ago
- Auto detection of apt proxies in the LAN, caching and checking status☆10Feb 13, 2025Updated last year