☆23Mar 16, 2026Updated 3 months ago
Alternatives and similar repositories for pytorch-micro-benchmarking
Users that are interested in pytorch-micro-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆105Jun 26, 2026Updated last week
- ☆40Jun 26, 2026Updated last week
- OpenACC* to OpenMP* API assisting migration tool☆41Dec 15, 2025Updated 6 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 11 months ago
- ☆30Jun 16, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Repository of machine learning benchmarks☆50Jun 8, 2026Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆28May 28, 2026Updated last month
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- ROCm Documentation Python package for ReadTheDocs build standardization☆16Updated this week
- ☆80Jun 26, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆151Jun 24, 2026Updated last week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆96Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆73Apr 21, 2026Updated 2 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Feb 23, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Nov 11, 2025Updated 7 months ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆44Jan 30, 2026Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆29Jun 24, 2026Updated last week
- ☆15Jun 8, 2026Updated 3 weeks ago
- ☆24Mar 5, 2026Updated 3 months ago
- ☆20Jan 17, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆122Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Jun 26, 2026Updated last week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆21Sep 18, 2025Updated 9 months ago
- Research work about learning to do tracking.☆13Jun 28, 2019Updated 7 years ago
- AA计算器☆16Jun 11, 2020Updated 6 years ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 9 months ago
- WIP: Get Stable DIffusion Controlnet running with DirectML via ONNX☆16Mar 13, 2023Updated 3 years ago
- ☆71Jun 27, 2026Updated last week
- Repository with examples and exercises for OLCF and AMD's HIP training series☆17Oct 16, 2023Updated 2 years ago
- python package of rocm-smi-lib☆25Dec 15, 2025Updated 6 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆92Jun 16, 2026Updated 2 weeks ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated 2 months ago
- Development repository for the Triton language and compiler☆145Jun 23, 2026Updated last week
- Dockerfiles for the various software layers defined in the ROCm software platform☆521Jan 27, 2026Updated 5 months ago
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆1,039Updated this week
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆31Jan 21, 2026Updated 5 months ago
- ☆62Apr 7, 2026Updated 2 months ago