Systematic and comprehensive benchmarks for LLM systems.
☆57Jan 28, 2026Updated 3 months ago
Alternatives and similar repositories for LMBenchmark
Users that are interested in LMBenchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18May 28, 2024Updated last year
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Sep 29, 2022Updated 3 years ago
- ☆63May 16, 2025Updated 11 months ago
- ☆158Oct 9, 2024Updated last year
- LMCache on Ascend☆66Apr 9, 2026Updated 3 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生命的轨迹漫步,重新触摸那些有温度的瞬间。☆28Apr 1, 2026Updated last month
- GeminiFS: A Companion File System for GPUs☆73Feb 18, 2025Updated last year
- Canary release with helm (Deprecated since compass v2.8)☆13Sep 28, 2020Updated 5 years ago
- Preview Code for Continuum Paper☆71Apr 13, 2026Updated 2 weeks ago
- A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search☆21Jul 22, 2025Updated 9 months ago
- KV cache store for distributed LLM inference☆410Nov 13, 2025Updated 5 months ago
- ☆26Apr 22, 2026Updated last week
- ☆73Jan 18, 2026Updated 3 months ago
- NVIDIA Inference Xfer Library (NIXL)☆1,011Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Incubating P/D sidecar for llm-d☆17Nov 13, 2025Updated 5 months ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 9 months ago
- MLCommons Science benchmarking working group☆13Apr 17, 2026Updated 2 weeks ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆8,132Updated this week
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- COCCL: Compression and precision co-aware collective communication library☆30Mar 16, 2025Updated last year
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- An experimental tool to modify YAMLs without losing (most of) comment lines.☆16Sep 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- ☆13Apr 30, 2024Updated 2 years ago
- Nabla Containers blog☆12May 26, 2021Updated 4 years ago
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 4 years ago
- Source Code for Partial Interference☆10Dec 17, 2022Updated 3 years ago
- [WIP] Open Source WakaTime Server☆14Feb 4, 2019Updated 7 years ago
- ☆12Mar 26, 2024Updated 2 years ago
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆2,312Updated this week
- ☆10Apr 7, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This website contains the python code accompanying the book "Mathematical Foundations of Deep Learning Models and Algorithms" by Konstant…☆55Nov 24, 2025Updated 5 months ago
- a QEMU + gem5 co-simulation framework for AMD MI300X GPU research.☆44Apr 19, 2026Updated 2 weeks ago
- Evaluation Suite for NVMe devices☆14Nov 14, 2024Updated last year
- llm-d benchmark scripts and tooling☆58Apr 25, 2026Updated last week
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago
- JUPITER Benchmark Suite☆24Jul 18, 2025Updated 9 months ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year