ROCm/amd_matrix_instruction_calculator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ROCm/amd_matrix_instruction_calculator)

ROCm / amd_matrix_instruction_calculator

A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators

☆133

Alternatives and similar repositories for amd_matrix_instruction_calculator

Users that are interested in amd_matrix_instruction_calculator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

carlushuang / gcnasm
View on GitHub
amdgpu example code in hip/asm
☆60Apr 22, 2026Updated 2 weeks ago
ROCm / rocprofiler-compute
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆166Apr 14, 2026Updated 3 weeks ago
ROCm / roc-stdpar
View on GitHub
☆19Jan 17, 2024Updated 2 years ago
GPUOpen-Tools / isa_spec_manager
View on GitHub
Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.
☆48Apr 9, 2026Updated 3 weeks ago
ROCm / composable_kernel
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror
☆529May 1, 2026Updated last week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
AMDResearch / hpcfund
View on GitHub
AMD HPC Research Fund Cloud
☆19Apr 17, 2026Updated 3 weeks ago
ROCm / rocmProfileData
View on GitHub
☆30Apr 28, 2026Updated last week
ROCm / rocWMMA
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆140Updated this week
ROCm / iris
View on GitHub
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
☆188Updated this week
sunlex0717 / DissectingTensorCores
View on GitHub
☆114Apr 19, 2024Updated 2 years ago
ROCm / aiter
View on GitHub
AI Tensor Engine for ROCm
☆425Updated this week
ROCm / rocMLIR
View on GitHub
☆178Updated this week
JohndeVostok / APE
View on GitHub
A GPU FP32 computation method with Tensor Cores.
☆26Dec 8, 2025Updated 5 months ago
geobacter-rs / geobacter
View on GitHub
A single source Rust co-processor programming framework; runtime && Rust custom drivers
☆45Oct 6, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ROCm / rocBLAS-Examples
View on GitHub
Examples illustrating usage of the rocBLAS library
☆17Aug 12, 2024Updated last year
ROCm / rocPRIM
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆176Apr 29, 2026Updated last week
ROCm / Tensile
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆258Apr 30, 2026Updated last week
ROCm / rocSPARSE
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆135Apr 22, 2026Updated 2 weeks ago
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆179Updated this week
CRobeck / instrument-amdgpu-kernels
View on GitHub
LLVM/MLIR based compiler instrumentation of AMD GPU kernels
☆20Jul 13, 2025Updated 9 months ago
olcf / hip-training-series
View on GitHub
Repository with examples and exercises for OLCF and AMD's HIP training series
☆17Oct 16, 2023Updated 2 years ago
CHIP-SPV / chipStar
View on GitHub
chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.
☆326Apr 30, 2026Updated last week
ROCm / rocr_debug_agent
View on GitHub
The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.
☆32Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
graphcore-research / unit-scaling-demo
View on GitHub
Unit Scaling demo and experimentation code
☆16Mar 12, 2024Updated 2 years ago
ROCm / rocm-libraries
View on GitHub
super repo for rocm libraries
☆330Updated this week
ROCm / hipBLAS
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆152Apr 28, 2026Updated last week
amd / amd-lab-notes
View on GitHub
AMD lab notes with code examples to demonstrate use of AMD GPUs
☆113Jun 28, 2024Updated last year
ROCm / rocprofiler
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆153Apr 14, 2026Updated 3 weeks ago
Snektron / exaregex
View on GitHub
Zig regex experiment
☆13Nov 6, 2025Updated 6 months ago
ROCm / rocHPL
View on GitHub
High Performance Linpack for Next-Generation AMD HPC Accelerators
☆71Apr 21, 2026Updated 2 weeks ago
ROCm / hipTensor
View on GitHub
AMD’s C++ library for accelerating tensor primitives
☆49Apr 30, 2026Updated last week
AMDComputeLibraries / ComputeApps
View on GitHub
Compute applications.
☆25Dec 12, 2019Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bollu / polymage
View on GitHub
PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation
☆14Jul 15, 2016Updated 9 years ago
NVlabs / mixedproxy
View on GitHub
☆16Nov 14, 2023Updated 2 years ago
ROCm / rocm-examples
View on GitHub
A collection of examples for the ROCm software stack
☆289Updated this week
ROCm / libhipcxx
View on GitHub
The C++ Standard Library for your entire system.
☆27Apr 9, 2026Updated last month
RadeonFlow / RadeonFlow_Kernels
View on GitHub
Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X
☆77Feb 11, 2026Updated 2 months ago
ROCm / ROCR-Runtime
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆272Updated this week
paranumal / hipBone
View on GitHub
☆17Apr 9, 2025Updated last year