sgl-project / omeLinks
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
☆77Updated this week
Alternatives and similar repositories for ome
Users that are interested in ome are comparing it to the libraries listed below
Sorting:
- ☆28Updated 2 months ago
- A collection of reproducible inference engine benchmarks☆31Updated 2 months ago
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆49Updated last month
- ☆55Updated 9 months ago
- The driver for LMCache core to run in vLLM☆42Updated 4 months ago
- ☆37Updated this week
- Write a fast kernel and run it on Discord. See how you compare against the best!☆46Updated this week
- TritonParse is a tool designed to help developers analyze and debug Triton kernels by visualizing the compilation process and source code…☆93Updated last week
- ☆26Updated 3 months ago
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆24Updated last month
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆112Updated this week
- TORCH_LOGS parser for PT2☆43Updated last month
- Cloud Native Benchmarking of Foundation Models☆38Updated 2 weeks ago
- ☆62Updated 4 months ago
- TensorRT LLM Benchmark Configuration☆13Updated 11 months ago
- extensible collectives library in triton☆86Updated 2 months ago
- Systematic and comprehensive benchmarks for LLM systems.☆17Updated last week
- ☆28Updated 5 months ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆28Updated 3 months ago
- ☆35Updated last month
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 3 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆170Updated this week
- Make triton easier☆46Updated last year
- High-performance safetensors model loader☆40Updated this week
- ☆37Updated 6 months ago
- Benchmark suite for LLMs from Fireworks.ai☆76Updated 3 weeks ago
- DeeperGEMM: crazy optimized version☆69Updated last month
- ☆81Updated 7 months ago
- ☆47Updated last year