ROCm / MIOpen
AMD's Machine Intelligence Library
☆1,117Updated this week
Alternatives and similar repositories for MIOpen:
Users that are interested in MIOpen are comparing it to the libraries listed below
- Next generation BLAS implementation for ROCm platform☆359Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆434Updated 4 years ago
- nGraph has moved to OpenVINO☆1,349Updated 4 years ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆221Updated this week
- AMD's graph optimization engine.☆208Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆552Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆451Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- AMD ROCm™ Software - GitHub Home☆4,980Updated this week
- Tuned OpenCL BLAS☆1,084Updated 3 months ago
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,896Updated this week
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 9 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,424Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆235Updated this week
- Benchmarking Deep Learning operations on different hardware☆1,081Updated 3 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆853Updated 8 months ago
- MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into …☆190Updated this week
- ROCm Communication Collectives Library (RCCL)☆297Updated this week
- pocl - Portable Computing Language☆954Updated this week
- a software library containing BLAS functions written in OpenCL☆851Updated 6 months ago
- MIOpenGEMM is now deprecated☆62Updated last year
- GPUOCelot: A dynamic compilation framework for PTX☆285Updated last year
- Compute Library for Deep Neural Networks (clDNN)☆574Updated 2 years ago
- Intel® Extension for TensorFlow*☆329Updated last month
- common in-memory tensor structure☆942Updated 2 weeks ago
- ☆630Updated this week
- AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver☆348Updated this week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆442Updated 3 months ago
- A performant and modular runtime for TensorFlow☆759Updated 2 weeks ago