ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆222Updated this week
Alternatives and similar repositories for pytorch:
Users that are interested in pytorch are comparing it to the libraries listed below
- Dockerfiles for the various software layers defined in the ROCm software platform☆453Updated last month
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 10 months ago
- AMD's graph optimization engine.☆213Updated this week
- TensorFlow ROCm port☆689Updated this week
- Legacy ROCm Software Platform Documentation☆113Updated last year
- ROCm Device Libraries☆97Updated 10 months ago
- AMD's Machine Intelligence Library☆1,130Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆237Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 4 years ago
- ☆106Updated 2 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆365Updated this week
- ROCm Communication Collectives Library (RCCL)☆305Updated this week
- TensorFlow-nGraph bridge☆136Updated 4 years ago
- Large Model Support in Tensorflow☆202Updated 4 years ago
- ROCm's Thunk Interface☆87Updated last week
- A collection of examples for the ROCm software stack☆191Updated last week
- MIOpenGEMM is now deprecated☆62Updated last year
- Examples for HIP☆203Updated 3 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆389Updated 2 months ago
- oneCCL Bindings for Pytorch*☆90Updated last week
- BLAS-like Library Instantiation Software Framework☆134Updated this week
- Computation using data flow graphs for scalable machine learning☆67Updated this week
- common in-memory tensor structure☆963Updated last week
- Guide for building custom op for TensorFlow☆378Updated last year
- ☆408Updated this week
- Intel® Extension for TensorFlow*☆334Updated this week
- CUDA Kernel Benchmarking Library☆593Updated last week
- Deep Learning Benchmark for comparing the performance of DL frameworks, GPUs, and single vs half precision☆429Updated 4 years ago