ROCm / pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆219Updated this week
Related projects ⓘ
Alternatives and complementary repositories for pytorch
- AMD's graph optimization engine.☆185Updated this week
- Next generation BLAS implementation for ROCm platform☆346Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆431Updated 2 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆220Updated this week
- TensorFlow ROCm port☆687Updated this week
- ROCm Communication Collectives Library (RCCL)☆265Updated this week
- A collection of examples for the ROCm software stack☆166Updated this week
- Large Model Support in Tensorflow☆201Updated 4 years ago
- ROCm Device Libraries☆98Updated 6 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆309Updated this week
- AMD's Machine Intelligence Library☆1,075Updated this week
- Python bindings for NVTX☆66Updated last year
- Legacy ROCm Software Platform Documentation☆112Updated last year
- Issues related to MLPerf™ training policies, including rules and suggested changes☆92Updated last month
- ROCm BLAS marshalling library☆118Updated this week
- (Deprecated) hipCaffe: the HIP port of Caffe☆124Updated 6 months ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆363Updated 2 months ago
- Development repository for the Triton language and compiler☆92Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆223Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆433Updated 4 years ago
- ☆236Updated 3 years ago
- Deep Learning Primitives and Mini-Framework for OpenCL☆172Updated 2 months ago
- HIPIFY: Convert CUDA to Portable C++ Code☆523Updated this week
- ROCm Parallel Primitives☆161Updated this week
- Nod.ai 🦈 version of 👻 . You probably want to start at https://github.com/nod-ai/shark for the product and the upstream IREE repository …☆107Updated this week
- Examples for HIP☆201Updated this week
- ☆83Updated 5 months ago
- The Foundation for All Legate Libraries☆189Updated last month
- CUDA Kernel Benchmarking Library☆512Updated 2 weeks ago
- Next generation FFT implementation for ROCm☆174Updated this week