☆23Mar 16, 2026Updated last week
Alternatives and similar repositories for pytorch-micro-benchmarking
Users that are interested in pytorch-micro-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆97Updated this week
- ☆38Updated this week
- HPCG benchmark based on ROCm platform☆39Mar 11, 2026Updated last week
- OpenACC* to OpenMP* API assisting migration tool☆41Dec 15, 2025Updated 3 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 8 months ago
- ☆30Mar 2, 2026Updated 3 weeks ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆26Feb 26, 2026Updated 3 weeks ago
- ☆72Updated this week
- ROCm Documentation Python package for ReadTheDocs build standardization☆17Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆150Updated this week
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆94Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆68Dec 10, 2025Updated 3 months ago
- This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.☆36Feb 23, 2024Updated 2 years ago
- ☆16Nov 11, 2025Updated 4 months ago
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆43Jan 30, 2026Updated last month
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆29Mar 18, 2026Updated last week
- ☆12May 30, 2025Updated 9 months ago
- ☆24Mar 5, 2026Updated 2 weeks ago
- ☆19Jan 17, 2024Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆117Updated this week
- Benchmarks☆18Apr 28, 2025Updated 10 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- pytorch code examples for measuring the performance of collective communication calls in AI workloads☆19Sep 18, 2025Updated 6 months ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Sep 30, 2025Updated 5 months ago
- WIP: Get Stable DIffusion Controlnet running with DirectML via ONNX☆16Mar 13, 2023Updated 3 years ago
- ☆64Updated this week
- Unit Scaling demo and experimentation code☆16Mar 12, 2024Updated 2 years ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆17Oct 16, 2023Updated 2 years ago
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 3 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Feb 7, 2024Updated 2 years ago
- ☆47Nov 3, 2025Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆88Mar 5, 2026Updated 2 weeks ago
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 4 months ago
- ☆80Mar 18, 2026Updated last week
- The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.☆32Feb 27, 2026Updated 3 weeks ago
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- Development repository for the Triton language and compiler☆143Updated this week