mlcommons / mlperf_clientLinks
MLPerf Client is a benchmark for Windows, Linux and macOS, focusing on client form factors in ML inference scenarios.
☆72Updated 2 months ago
Alternatives and similar repositories for mlperf_client
Users that are interested in mlperf_client are comparing it to the libraries listed below
Sorting:
- No-code CLI designed for accelerating ONNX workflows☆226Updated 7 months ago
- AMD related optimizations for transformer models☆97Updated 3 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- Repository of model demos using TT-Buda☆63Updated 9 months ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- Tenstorrent console based hardware information program☆58Updated this week
- LLM inference in C/C++☆104Updated last month
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 6 months ago
- A collection of all available inference solutions for the LLMs☆94Updated 10 months ago
- ☆117Updated 3 weeks ago
- Intel® AI Super Builder☆153Updated this week
- High-Performance FP32 GEMM on CUDA devices☆117Updated last year
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated last week
- LLM training in simple, raw C/HIP for AMD GPUs☆57Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated last week
- AMD SMI☆113Updated last week
- Benchmark and optimize LLM inference across frameworks with ease☆158Updated 4 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆91Updated last week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆74Updated 11 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 4 months ago
- ☆219Updated last year
- This repository contains Dockerfiles, scripts, yaml files, Helm charts, etc. used to scale out AI containers with versions of TensorFlow …☆59Updated last week
- AI Tensor Engine for ROCm☆344Updated last week
- ☆135Updated last week
- Fully Open Language Models with Stellar Performance☆317Updated 2 months ago
- LLM training in simple, raw C/CUDA☆112Updated last year
- Inference server benchmarking tool☆141Updated 3 months ago
- Cray-LM unified training and inference stack.☆22Updated last year
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆38Updated last month