ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆219Updated this week
Related projects ⓘ
Alternatives and complementary repositories for pytorch
- Dockerfiles for the various software layers defined in the ROCm software platform☆432Updated 2 months ago
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- AMD's graph optimization engine.☆186Updated this week
- TensorFlow ROCm port☆688Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- ROCm Communication Collectives Library (RCCL)☆268Updated this week
- Legacy ROCm Software Platform Documentation☆113Updated last year
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 6 months ago
- ROCm Device Libraries☆98Updated 6 months ago
- ROCm BLAS marshalling library☆118Updated this week
- BLAS-like Library Instantiation Software Framework☆129Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆313Updated this week
- A collection of examples for the ROCm software stack☆167Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆224Updated this week
- ROCm's Thunk Interface☆83Updated 2 weeks ago
- RAND library for HIP programming language☆111Updated this week
- MIOpenGEMM is now deprecated☆61Updated last year
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 6 months ago
- Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.☆163Updated last week
- The NNEF Tools repository contains tools to generate and consume NNEF documents☆222Updated this week
- oneAPI Collective Communications Library (oneCCL)☆206Updated this week
- ☆101Updated this week
- AMD's Machine Intelligence Library☆1,081Updated this week
- ROCm Parallel Primitives☆162Updated this week
- CMake modules used within the ROCm libraries☆58Updated this week
- ☆236Updated 3 years ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆130Updated this week
- ☆231Updated this week
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆363Updated 3 months ago