ROCm / MIOpen
AMD's Machine Intelligence Library
☆1,130Updated this week
Alternatives and similar repositories for MIOpen:
Users that are interested in MIOpen are comparing it to the libraries listed below
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 4 years ago
- Tuned OpenCL BLAS☆1,090Updated 4 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated last week
- AMD's graph optimization engine.☆213Updated this week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,940Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆222Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆365Updated this week
- a software library containing BLAS functions written in OpenCL☆852Updated 7 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆564Updated this week
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 10 months ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆521Updated 6 years ago
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆237Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆453Updated last month
- TensorFlow ROCm port☆689Updated this week
- AMD ROCm™ Software - GitHub Home☆5,104Updated this week
- pocl - Portable Computing Language☆970Updated this week
- A collection of examples for the ROCm software stack☆191Updated last week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,471Updated this week
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆526Updated this week
- ☆250Updated this week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆441Updated 4 months ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,735Updated last year
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆389Updated 2 months ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆857Updated 9 months ago
- Open single and half precision gemm implementations☆378Updated last year
- ☆639Updated this week
- ROCm Communication Collectives Library (RCCL)☆305Updated this week
- Examples for HIP☆203Updated 3 months ago