ROCm / MIOpen
AMD's Machine Intelligence Library
☆1,049Updated this week
Related projects: ⓘ
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆428Updated 4 years ago
- Next generation BLAS implementation for ROCm platform☆341Updated this week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,690Updated this week
- Tuned OpenCL BLAS☆1,046Updated 3 months ago
- nGraph has moved to OpenVINO☆1,355Updated 3 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,669Updated 11 months ago
- TensorFlow ROCm port☆687Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆420Updated 3 weeks ago
- HIPIFY: Convert CUDA to Portable C++ Code☆499Updated this week
- AMD ROCm™ Software - GitHub Home☆4,493Updated this week
- Assembler for NVIDIA Maxwell architecture☆940Updated last year
- Acceleration package for neural networks on multi-core CPUs☆1,671Updated 3 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆293Updated this week
- Compute Library for Deep Neural Networks (clDNN)☆574Updated last year
- oneAPI Deep Neural Network Library (oneDNN)☆3,579Updated this week
- a software library containing BLAS functions written in OpenCL☆839Updated last month
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆221Updated this week
- Open single and half precision gemm implementations☆364Updated last year
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆840Updated 2 months ago
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 4 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,301Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆212Updated this week
- Benchmarking Deep Learning operations on different hardware☆1,065Updated 3 years ago
- This is a Experimental version of OpenCL by AMD Research, we now recommend you to use The official BVLC Caffe OpenCL branch is over at …☆516Updated 6 years ago
- AMD's graph optimization engine.☆183Updated this week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆437Updated 6 months ago
- pocl - Portable Computing Language☆911Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,169Updated this week
- Low-precision matrix multiplication☆1,772Updated 7 months ago
- ☆594Updated this week