carlushuang / gcnasm
amdgpu example code in hip/asm
☆29Updated 2 months ago
Alternatives and similar repositories for gcnasm:
Users that are interested in gcnasm are comparing it to the libraries listed below
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆82Updated last week
- ☆94Updated 11 months ago
- Dissecting NVIDIA GPU Architecture☆90Updated 2 years ago
- ☆43Updated 4 years ago
- An extension library of WMMA API (Tensor Core API)☆95Updated 9 months ago
- ☆141Updated this week
- rocWMMA☆106Updated this week
- ☆61Updated 3 months ago
- Assembler for NVIDIA Volta and Turing GPUs☆216Updated 3 years ago
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Updated 5 years ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆71Updated this week
- ☆38Updated 5 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆32Updated 4 years ago
- development repository for the open earth compiler☆79Updated 4 years ago
- A intelligent matrix format designer for SpMV☆10Updated last year
- 14 basic topics for VEGA64 performance optmization☆54Updated 4 years ago
- collection of benchmarks to measure basic GPU capabilities☆354Updated 2 months ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)