amdgpu example code in hip/asm
☆64Jun 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for gcnasm
Users that are interested in gcnasm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI Tensor Engine for ROCm☆460Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆137Apr 10, 2026Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆537Updated this week
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Oct 12, 2019Updated 6 years ago
- AiTer Optimized Model☆112Updated this week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Super fast FP32 matrix multiplication on RDNA3☆90Mar 30, 2025Updated last year
- Zig regex experiment☆13Nov 6, 2025Updated 7 months ago
- Commands that will make you more comfortable with the ROCm toolkit.☆18Aug 1, 2024Updated last year
- ☆21Mar 22, 2021Updated 5 years ago
- Cute layout visualization☆40Jan 18, 2026Updated 5 months ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆146Jun 7, 2026Updated last week
- ☆18Mar 12, 2025Updated last year
- A collection of examples for the ROCm software stack☆294Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 14 basic topics for VEGA64 performance optmization☆66Mar 18, 2021Updated 5 years ago
- ☆169Dec 27, 2024Updated last year