ROCm / rocBLASLinks
Next generation BLAS implementation for ROCm platform
☆381Updated this week
Alternatives and similar repositories for rocBLAS
Users that are interested in rocBLAS are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆245Updated this week
- ROCm BLAS marshalling library☆144Updated this week
- Next generation FFT implementation for ROCm☆195Updated this week
- Examples for HIP☆208Updated 6 months ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆258Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆173Updated this week
- ROCm Communication Collectives Library (RCCL)☆342Updated this week
- AMD's graph optimization engine.☆223Updated this week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 5 months ago
- rocWMMA☆115Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆121Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆103Updated this week
- ROCm Device Libraries☆97Updated last year
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆423Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆404Updated 5 months ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆156Updated this week
- Next generation LAPACK implementation for ROCm platform☆102Updated this week
- ☆140Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆224Updated this week
- A collection of examples for the ROCm software stack☆219Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆148Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆120Updated last week
- Next generation SPARSE implementation for ROCm platform☆127Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆587Updated this week
- MIOpenGEMM is now deprecated☆62Updated last year
- STREAM, for lots of devices written in many programming models☆343Updated 9 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Updated this week
- ROCm's Thunk Interface☆91Updated 3 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆330Updated 2 weeks ago
- AI Tensor Engine for ROCm☆207Updated this week