ROCm / hipBLASLtLinks
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆114Updated last week
Alternatives and similar repositories for hipBLASLt
Users that are interested in hipBLASLt are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated this week
- Bandwidth test for ROCm☆72Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆138Updated this week
- RCCL Performance Benchmark Tests☆82Updated last week
- AI Tensor Engine for ROCm☆322Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆256Updated last week
- Development repository for the Triton language and compiler☆137Updated this week
- AMD's graph optimization engine.☆268Updated last week
- monorepo for rocm libraries☆211Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- ☆162Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆123Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Updated this week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆92Updated last week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆499Updated this week
- ROCm Communication Collectives Library (RCCL)☆405Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆367Updated this week
- ☆155Updated this week
- AMD SMI☆103Updated last week
- OpenAI Triton backend for Intel® GPUs☆222Updated this week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆133Updated last week
- ☆130Updated last week
- Ahead of Time (AOT) Triton Math Library☆84Updated last week
- ☆54Updated this week
- super repo for rocm systems projects☆182Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆219Updated 10 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆269Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆133Updated last week
- AMD’s C++ library for accelerating tensor primitives☆46Updated last week