ROCm / rocmProfileDataLinks
☆24Updated 3 weeks ago
Alternatives and similar repositories for rocmProfileData
Users that are interested in rocmProfileData are comparing it to the libraries listed below
Sorting:
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆95Updated 2 weeks ago
- amdgpu example code in hip/asm☆32Updated 2 weeks ago
- ☆36Updated this week
- rocWMMA☆114Updated this week
- RCCL Performance Benchmark Tests☆67Updated last week
- ☆61Updated 5 months ago
- Advanced Profiling and Analytics for AMD Hardware☆156Updated this week
- Bandwidth test for ROCm☆56Updated 2 weeks ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆147Updated last week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated last week
- ☆146Updated this week
- An extension library of WMMA API (Tensor Core API)☆97Updated 10 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆38Updated 10 months ago
- A CUTLASS implementation using SYCL☆23Updated this week
- Development repository for the Triton language and compiler☆122Updated this week
- ☆26Updated this week
- ☆20Updated last month
- OpenAI Triton backend for Intel® GPUs☆187Updated this week
- AI Tensor Engine for ROCm☆201Updated this week
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆18Updated last month
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated last week
- ☆96Updated last year
- Dissecting NVIDIA GPU Architecture☆95Updated 2 years ago
- Repository to host ROCm Developer Hub Notebook Tutorials☆11Updated 2 weeks ago
- Stretching GPU performance for GEMMs and tensor contractions.☆242Updated last week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆97Updated this week
- Ahead of Time (AOT) Triton Math Library☆64Updated last week
- ☆20Updated 2 months ago
- ROCm BLAS marshalling library☆142Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 3 months ago