A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators
☆129Apr 10, 2026Updated last week
Alternatives and similar repositories for amd_matrix_instruction_calculator
Users that are interested in amd_matrix_instruction_calculator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- amdgpu example code in hip/asm☆58Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Mar 31, 2026Updated 2 weeks ago
- ☆19Jan 17, 2024Updated 2 years ago
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆47Apr 9, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆526Apr 11, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AMD HPC Research Fund Cloud☆19Updated this week
- ☆30Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆140Updated this week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆183Updated this week
- ☆24May 9, 2025Updated 11 months ago
- AI Tensor Engine for ROCm☆406Updated this week
- ☆113Apr 19, 2024Updated last year
- ☆174Updated this week
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A single source Rust co-processor programming framework; runtime && Rust custom drivers☆45Oct 6, 2023Updated 2 years ago
- Examples illustrating usage of the rocBLAS library☆17Aug 12, 2024Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆177Apr 8, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆255Apr 9, 2026Updated last week
- FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.☆155Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆135Updated this week
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 9 months ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆17Oct 16, 2023Updated 2 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆326Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.☆32Updated this week
- super repo for rocm libraries☆306Updated this week
- Unit Scaling demo and experimentation code☆16Mar 12, 2024Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆151Apr 7, 2026Updated last week
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆112Jun 28, 2024Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆153Updated this week
- Zig regex experiment☆13Nov 6, 2025Updated 5 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆69Apr 10, 2026Updated last week
- AMD’s C++ library for accelerating tensor primitives☆49Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- ☆15Nov 14, 2023Updated 2 years ago
- A collection of examples for the ROCm software stack☆286Updated this week
- The C++ Standard Library for your entire system.☆27Apr 9, 2026Updated last week
- Efficient implementation of DeepSeek Ops (Blockwise FP8 GEMM, MoE, and MLA) for AMD Instinct MI300X☆76Feb 11, 2026Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆270Updated this week