ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆223Updated this week
Alternatives and similar repositories for pytorch:
Users that are interested in pytorch are comparing it to the libraries listed below
- Dockerfiles for the various software layers defined in the ROCm software platform☆459Updated last month
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- AMD's graph optimization engine.☆214Updated this week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆392Updated 3 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week
- TensorFlow ROCm port☆690Updated this week
- Legacy ROCm Software Platform Documentation☆113Updated last year
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 11 months ago
- ROCm BLAS marshalling library☆136Updated this week
- ☆105Updated this week
- oneCCL Bindings for Pytorch*☆93Updated this week
- ROCm Communication Collectives Library (RCCL)☆317Updated this week
- ROCm Device Libraries☆97Updated 11 months ago
- BLAS-like Library Instantiation Software Framework☆137Updated 2 weeks ago
- A collection of examples for the ROCm software stack☆200Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 4 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 10 months ago
- common in-memory tensor structure☆974Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆569Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆242Updated this week
- ROCm Parallel Primitives☆171Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into …☆192Updated this week
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆376Updated this week
- ☆250Updated this week
- OpenCL port of TensorFlow using SYCL, generic instructions for building are here:☆61Updated 5 years ago
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆106Updated 3 months ago
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 3 months ago
- Next generation FFT implementation for ROCm☆190Updated this week