Systematic and comprehensive benchmarks for LLM systems.
☆51Jan 28, 2026Updated last month
Alternatives and similar repositories for LMBenchmark
Users that are interested in LMBenchmark are comparing it to the libraries listed below
Sorting:
- ☆17May 28, 2024Updated last year
- A new memory mapping interface for efficient direct user-space access to byte-addressable storage, published in MICRO2022.☆15Sep 29, 2022Updated 3 years ago
- ☆63May 16, 2025Updated 10 months ago
- ☆152Oct 9, 2024Updated last year
- https://xuruowei.com 是她的家人朋友们和她的爱人高策为纪念她留下的。徐若薇于 2026 年 2 月 28 日离世。我们希望通过这个时间线纪念她的一生——照片、故事、文字、音乐与她钟爱的一切。沿着她生命的轨迹漫步,重新触摸那些有温度的瞬间。☆27Mar 2, 2026Updated 3 weeks ago
- GeminiFS: A Companion File System for GPUs☆72Feb 18, 2025Updated last year
- ☆22Dec 25, 2025Updated 2 months ago
- A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search☆21Jul 22, 2025Updated 8 months ago
- ☆21Mar 15, 2026Updated last week
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 7 months ago
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- COCCL: Compression and precision co-aware collective communication library☆30Mar 16, 2025Updated last year
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"☆10Aug 2, 2022Updated 3 years ago
- 🎙️ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at …☆10Feb 10, 2022Updated 4 years ago
- Simple example for learning and serving 'MNIST' in kubernetes cluster☆10Mar 27, 2019Updated 6 years ago
- NVIDIA Inference Xfer Library (NIXL)☆945Updated this week
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- KV cache store for distributed LLM inference☆399Nov 13, 2025Updated 4 months ago
- ☆13Apr 30, 2024Updated last year
- Nabla Containers blog☆12May 26, 2021Updated 4 years ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆7,693Updated this week
- Source Code for Partial Interference☆10Dec 17, 2022Updated 3 years ago
- [WIP] Open Source WakaTime Server☆14Feb 4, 2019Updated 7 years ago
- ☆12Mar 26, 2024Updated last year
- ☆10Apr 7, 2020Updated 5 years ago
- a QEMU + gem5 co-simulation framework for AMD MI300X GPU research.☆29Updated this week
- Evaluation Suite for NVMe devices☆13Nov 14, 2024Updated last year
- ☆11Dec 20, 2024Updated last year
- llm-d benchmark scripts and tooling☆49Updated this week
- JUPITER Benchmark Suite☆23Jul 18, 2025Updated 8 months ago
- WIPE implementation☆13Nov 26, 2023Updated 2 years ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.☆10Feb 26, 2025Updated last year
- Fast SHA-256 that utilizes the GPU☆13Dec 17, 2021Updated 4 years ago
- ☆15Jan 21, 2023Updated 3 years ago
- Go Client for jAccount☆12Jul 18, 2025Updated 8 months ago
- Keyformer proposes KV Cache reduction through key tokens identification and without the need for fine-tuning☆57Mar 26, 2024Updated last year
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O o…☆21Feb 10, 2026Updated last month
- ☆14Jun 4, 2024Updated last year