ROCm / ROCm
AMD ROCm™ Software - GitHub Home
☆5,273Updated this week
Alternatives and similar repositories for ROCm
Users that are interested in ROCm are comparing it to the libraries listed below
Sorting:
- HIP: C++ Heterogeneous-Compute Interface for Portability☆4,006Updated this week
- AMD's Machine Intelligence Library☆1,146Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆465Updated 3 weeks ago
- TensorFlow ROCm port☆691Updated this week
- OpenCL integration for Python, plus shiny features☆1,098Updated last week
- DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for comm…☆2,446Updated last month
- CUDA on non-NVIDIA GPUs☆11,325Updated this week
- AMDGPU Driver with KFD used by the ROCm project. Also contains the current Linux Kernel that matches this base driver☆360Updated last month
- Simple, safe way to store and distribute tensors☆3,268Updated last week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,169Updated this week
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform☆1,845Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆575Updated last week
- oneAPI Deep Neural Network Library (oneDNN)☆3,790Updated this week
- Tuned OpenCL BLAS☆1,103Updated 3 weeks ago
- Optimized primitives for collective multi-GPU communication☆3,710Updated 2 weeks ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆227Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆16,539Updated this week
- Compiler for Neural Network hardware accelerators☆3,291Updated last year
- Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.☆1,920Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆438Updated 4 years ago
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,764Updated this week
- DLPrimitives/OpenCL out of tree backend for pytorch☆346Updated 8 months ago
- a language for fast, portable data-parallel computation☆6,058Updated this week
- CUDA Templates for Linear Algebra Subroutines☆7,450Updated 2 weeks ago
- NVIDIA Linux open GPU kernel module source☆15,773Updated this week
- build scripts for ROCm☆186Updated last year
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆859Updated 3 weeks ago
- Intel® Graphics Compute Runtime for oneAPI Level Zero and OpenCL™ Driver☆1,233Updated this week
- CUDA integration for Python, plus shiny features☆1,936Updated last week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,303Updated last year