mlcommons / mlperf_clientLinks
MLPerf Client is a benchmark for Windows and macOS, focusing on client form factors in ML inference scenarios.
☆43Updated last week
Alternatives and similar repositories for mlperf_client
Users that are interested in mlperf_client are comparing it to the libraries listed below
Sorting:
- No-code CLI designed for accelerating ONNX workflows☆207Updated last month
- ☆293Updated this week
- LLM inference in C/C++☆98Updated last week
- ☆102Updated 11 months ago
- TPI-LLM: Serving 70b-scale LLMs Efficiently on Low-resource Edge Devices☆186Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated last year
- ☆102Updated last month
- OpenVINO Tokenizers extension☆38Updated this week
- NextCoder: Robust Adaptation of Code LMs to Diverse Code Edits (ICML'25)☆24Updated last month
- Utils for Unsloth☆122Updated this week
- ☆14Updated 2 weeks ago
- Run Generative AI models with simple C++/Python API and using OpenVINO Runtime☆316Updated this week
- AMD related optimizations for transformer models☆81Updated last month
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆168Updated 8 months ago
- ☆44Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆28Updated 7 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆43Updated 10 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆39Updated 3 weeks ago
- A collection of all available inference solutions for the LLMs☆91Updated 5 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆43Updated 2 months ago
- llama.cpp fork used by GPT4All☆56Updated 5 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆164Updated this week
- Train, tune, and infer Bamba model☆131Updated 2 months ago
- A composite model for simulating integrated nearshore and aeolian sediment transport.☆7Updated 8 years ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆74Updated 7 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- ☆17Updated 7 months ago
- CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning☆131Updated this week