ROCm / gpuaidevLinks
Repository to host ROCm Developer Hub Notebook Tutorials
☆11Updated 2 weeks ago
Alternatives and similar repositories for gpuaidev
Users that are interested in gpuaidev are comparing it to the libraries listed below
Sorting:
- ☆24Updated 3 weeks ago
- ☆36Updated this week
- AI Tensor Engine for ROCm☆201Updated this week
- ☆61Updated 5 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆95Updated 2 weeks ago
- OpenAI Triton backend for Intel® GPUs☆187Updated this week
- Experimental projects related to TensorRT☆105Updated this week
- Development repository for the Triton language and compiler☆122Updated this week
- CUTLASS and CuTe Examples☆52Updated 5 months ago
- rocWMMA☆114Updated this week
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆12Updated 3 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆401Updated this week
- CUDA Matrix Multiplication Optimization☆188Updated 10 months ago
- Ahead of Time (AOT) Triton Math Library☆64Updated last week
- ☆109Updated 3 weeks ago
- ☆208Updated 10 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆79Updated this week
- A collection of examples for the ROCm software stack☆215Updated last week
- ☆25Updated this week
- amdgpu example code in hip/asm☆32Updated 2 weeks ago
- An extension library of WMMA API (Tensor Core API)☆97Updated 10 months ago
- collection of benchmarks to measure basic GPU capabilities☆376Updated 3 months ago
- Fastest kernels written from scratch☆269Updated 2 months ago
- Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Lar…☆45Updated last month
- Shared Middle-Layer for Triton Compilation☆251Updated this week
- ☆46Updated this week
- ☆215Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆75Updated this week
- ☆146Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆97Updated this week