MLPerf Client is a benchmark for Windows, Linux and macOS, focusing on client form factors in ML inference scenarios.
☆83Apr 10, 2026Updated this week
Alternatives and similar repositories for mlperf_client
Users that are interested in mlperf_client are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- .NET application for stable diffusion, Leveraging OnnxStack, Amuse seamlessly integrates many StableDiffusion capabilities all within the…☆23Dec 29, 2023Updated 2 years ago
- A package for wrapping iterative MLJ models in a control strategy☆12Nov 6, 2025Updated 5 months ago
- Parser for lspci output on remote machines☆16Jun 10, 2021Updated 4 years ago
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆29Dec 18, 2024Updated last year
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆15Mar 30, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Demos for my Talk at .NET Day Switzerland☆13Aug 30, 2024Updated last year
- Source code for Activated LoRA☆25Nov 22, 2025Updated 4 months ago
- PSAS webpage☆17Mar 26, 2021Updated 5 years ago
- some sample caffemodel, prototxt, test images and pre compiled loadabes .☆13Apr 30, 2021Updated 4 years ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆25Oct 23, 2025Updated 5 months ago
- Hexagon disassembler code generator for Rizin from the LLVM definitions.☆19Mar 19, 2026Updated 3 weeks ago
- Curated collection of AI inference engineering resources — LLM serving, GPU kernels, quantization, distributed inference, and production …☆96Feb 4, 2026Updated 2 months ago
- ☆14Updated this week
- ☆11Dec 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Condense source code for LLM analysis by extracting essential highlights, utilizing a simplified version of Paul Gauthier's repomap techn…☆14Mar 3, 2024Updated 2 years ago
- 💬 Use your GitHub repo's Issues as your own ChatGPT (yes, really!)☆13Apr 25, 2025Updated 11 months ago
- This repository implements the "Ralph" autonomous coding loop pattern, designed to be agnostic of the specific AI agent being used. Wheth…☆31Jan 7, 2026Updated 3 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- PTS test suite based on SNIA's Solid State Storage Performance Test Specification☆20Sep 12, 2018Updated 7 years ago
- ☆54Oct 31, 2021Updated 4 years ago
- a plugin for SimonW llm CLI which analyses diffs in a local git repository , generates commit messages in an interactive prompt & commits☆20Jun 27, 2025Updated 9 months ago
- ☆13Nov 24, 2025Updated 4 months ago
- Fast and memory-efficient exact attention☆20Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Nov 26, 2024Updated last year
- Semantic diff for markdown☆16Feb 20, 2015Updated 11 years ago
- JSON Compact Schema (JSON-CS)☆15Mar 19, 2025Updated last year
- Converts models from HuggingFace to CoreML format.☆30Nov 9, 2024Updated last year
- Notes and artifacts from the ONNX steering committee☆28Updated this week
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 7 months ago
- ☆19Jul 12, 2025Updated 9 months ago
- DIY Makecode Arcade console with Rp2040 Pico☆17Oct 3, 2023Updated 2 years ago
- A minimalist plugin that collapses Starlight's sidebars and expands the main content to full width, creating a distraction-free, fullscre…☆24Oct 4, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Exercise of the book Algorithms In C, Part 1-4, Fundamentals, Data Strcuture, Sorting, Searching, written by Robert Sedgewick☆25Oct 1, 2020Updated 5 years ago
- An in-memory compressed cache for gigabytes of data written in Go.☆19Feb 6, 2023Updated 3 years ago
- ☆13Updated this week
- ☆16Nov 24, 2025Updated 4 months ago
- An I/O benchmark for deep Learning applications☆105Mar 18, 2026Updated 3 weeks ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- ☆32Dec 14, 2025Updated 4 months ago