amd / RyzenAI-SW
☆508Updated 3 weeks ago
Alternatives and similar repositories for RyzenAI-SW:
Users that are interested in RyzenAI-SW are comparing it to the libraries listed below
- ☆407Updated last week
- ☆296Updated 3 weeks ago
- AI Tensor Engine for ROCm☆180Updated this week
- A collection of examples for the ROCm software stack☆205Updated this week
- Intel® NPU Acceleration Library☆667Updated 3 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆57Updated this week
- Dockerfiles for the various software layers defined in the ROCm software platform☆459Updated this week
- build scripts for ROCm☆189Updated last year
- Intel® NPU (Neural Processing Unit) Driver☆244Updated 3 weeks ago
- Fork of LLVM to support AMD AIEngine processors☆134Updated this week
- Development repository for the Triton language and compiler☆118Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆266Updated this week
- 8-bit CUDA functions for PyTorch☆48Updated 2 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆383Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆571Updated last week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆91Updated this week
- Next generation BLAS implementation for ROCm platform☆366Updated this week
- Local LLM Server with NPU Acceleration☆156Updated last week
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆203Updated 2 months ago
- AMD related optimizations for transformer models☆75Updated 5 months ago
- ☆128Updated this week
- OpenAI Triton backend for Intel® GPUs☆182Updated this week
- Fast and memory-efficient exact attention☆171Updated this week
- ROCm SMI LIB☆133Updated last week
- DLPrimitives/OpenCL out of tree backend for pytorch☆341Updated 7 months ago
- ☆60Updated last year
- AMD's graph optimization engine.☆215Updated this week
- ☆106Updated 2 weeks ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆11Updated 10 months ago
- See how to play with ROCm, run it with AMD GPUs!☆25Updated this week