ROCm/pytorch-micro-benchmarking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ROCm/pytorch-micro-benchmarking)

ROCm / pytorch-micro-benchmarking

☆23

Alternatives and similar repositories for pytorch-micro-benchmarking

Users that are interested in pytorch-micro-benchmarking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ROCm / rocHPCG
View on GitHub
HPCG benchmark based on ROCm platform
☆41Updated this week
ROCm / hip-tests
View on GitHub
☆40Updated this week
ROCm / ROCmValidationSuite
View on GitHub
A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…
☆107Updated this week
intel / intel-application-migration-tool-for-openacc-to-openmp
View on GitHub
OpenACC* to OpenMP* API assisting migration tool
☆41Dec 15, 2025Updated 7 months ago
ROCm / rocmProfileData
View on GitHub
☆30Jun 16, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ROCm / rocm-blogs
View on GitHub
☆81Updated this week
ROCm / rocprofiler-systems
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆28May 28, 2026Updated last month
khaki3 / ptxas-wrapper
View on GitHub
A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code
☆16Mar 19, 2023Updated 3 years ago
ROCm / rocm-docs-core
View on GitHub
ROCm Documentation Python package for ReadTheDocs build standardization
☆16Updated this week
ROCm / hipBLAS
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆151Updated this week
EmbeddedLLM / vllm
View on GitHub
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆96Updated this week
mlcommons / training_results_v1.0
View on GitHub
This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.
☆36Feb 23, 2024Updated 2 years ago
ROCm / aws-ofi-rccl
View on GitHub
☆18Nov 11, 2025Updated 8 months ago
icl-utk-edu / hpl
View on GitHub
☆15Jul 25, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
argonne-lcf / SimAI-Bench
View on GitHub
ALCF benchmarks for coupled simulation and AI workflows
☆17Dec 11, 2025Updated 7 months ago
ROCm / roc-stdpar
View on GitHub
☆20Jan 17, 2024Updated 2 years ago
ROCm / hipBLASLt
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆114Updated this week
ROCm / hipify_torch
View on GitHub
☆25Mar 5, 2026Updated 4 months ago
IBM / pytorch-communication-benchmarks
View on GitHub
pytorch code examples for measuring the performance of collective communication calls in AI workloads
☆21Sep 18, 2025Updated 10 months ago
ROCm / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆12Jun 24, 2024Updated 2 years ago
lanl / benchmarks
View on GitHub
Benchmarks
☆20Jun 24, 2026Updated 3 weeks ago
tpof314 / AACalculator
View on GitHub
AA计算器
☆16Jun 11, 2020Updated 6 years ago
ROCm / pyrsmi
View on GitHub
python package of rocm-smi-lib
☆25Dec 15, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ROCm / TransferBench
View on GitHub
TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
☆74Updated this week
ROCm / rocprof-compute-viewer
View on GitHub
☆62Jul 16, 2026Updated last week
ROCm / rocHPL
View on GitHub
High Performance Linpack for Next-Generation AMD HPC Accelerators
☆73Apr 21, 2026Updated 3 months ago
csc-training / hip-programming
View on GitHub
☆15Jun 8, 2026Updated last month
NERSC / nersc-dl-multigpu
View on GitHub
single-GPU to multi-GPU training of PyTorch apps at NERSC
☆25Apr 23, 2026Updated 3 months ago
azrael417 / mlperf-deepcam
View on GitHub
This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.
☆16Sep 30, 2025Updated 9 months ago
olcf / hip-training-series
View on GitHub
Repository with examples and exercises for OLCF and AMD's HIP training series
☆17Oct 16, 2023Updated 2 years ago
InfraWhisperer / llmtop
View on GitHub
htop for your LLM inference cluster
☆17May 11, 2026Updated 2 months ago
NERSC / intro-HPC-bootcamp-2023
View on GitHub
☆14Sep 7, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
graphcore-research / unit-scaling-demo
View on GitHub
Unit Scaling demo and experimentation code
☆16Mar 12, 2024Updated 2 years ago
mk1-project / quickreduce
View on GitHub
QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.
☆38Aug 29, 2025Updated 10 months ago
ROCm / ATOM
View on GitHub
AiTer Optimized Model
☆142Updated this week
ROCm / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆122Updated this week
AMD-AGI / TraceLens
View on GitHub
Automating analysis from trace files
☆81Updated this week
sparticlesteve / cosmoflow-benchmark
View on GitHub
Benchmark implementation of CosmoFlow in TensorFlow Keras
☆22Feb 7, 2024Updated 2 years ago
ROCm / rccl-tests
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆92Jul 14, 2026Updated last week