ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆221Updated this week
Alternatives and similar repositories for pytorch:
Users that are interested in pytorch are comparing it to the libraries listed below
- AMD's graph optimization engine.☆208Updated this week
- Next generation BLAS implementation for ROCm platform☆360Updated this week
- TensorFlow ROCm port☆690Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆451Updated this week
- ROCm Communication Collectives Library (RCCL)☆297Updated this week
- ☆105Updated 3 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆382Updated last month
- A collection of examples for the ROCm software stack☆185Updated this week
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 9 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆552Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆349Updated this week
- Examples for HIP☆202Updated 2 months ago
- oneCCL Bindings for Pytorch*☆88Updated last month
- Large Model Support in Tensorflow☆202Updated 4 years ago
- Computation using data flow graphs for scalable machine learning☆67Updated this week
- AMD's Machine Intelligence Library☆1,116Updated this week
- Legacy ROCm Software Platform Documentation☆113Updated last year
- Benchmark Suite for Deep Learning☆257Updated this week
- DLPrimitives/OpenCL out of tree backend for pytorch☆317Updated 5 months ago
- oneAPI Collective Communications Library (oneCCL)☆222Updated 3 weeks ago
- TensorFlow-nGraph bridge☆136Updated 3 years ago
- ROCm BLAS marshalling library☆131Updated this week
- ROCm Device Libraries☆97Updated 9 months ago
- MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into …☆190Updated this week
- Python bindings for NVTX☆66Updated last year
- Development repository for the Triton language and compiler☆107Updated this week
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆167Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆235Updated this week
- Deep Learning Primitives and Mini-Framework for OpenCL☆187Updated 5 months ago