ROCm / TheRockLinks
The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm
☆126Updated this week
Alternatives and similar repositories for TheRock
Users that are interested in TheRock are comparing it to the libraries listed below
Sorting:
- Development repository for the Triton language and compiler☆122Updated this week
- AI Tensor Engine for ROCm☆201Updated this week
- rocWMMA☆114Updated last week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆97Updated this week
- ROCm BLAS marshalling library☆142Updated this week
- a simple Flash Attention v2 implementation with ROCM (RDNA3 GPU, roc wmma), mainly used for stable diffusion(ComfyUI) in Windows ZLUDA en…☆42Updated 9 months ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆277Updated this week
- ☆326Updated 2 months ago
- ☆146Updated this week
- Bandwidth test for ROCm☆56Updated 2 weeks ago
- A collection of examples for the ROCm software stack☆215Updated last week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆100Updated 2 weeks ago
- AMD's graph optimization engine.☆220Updated this week
- ☆60Updated last year
- ☆136Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆192Updated 3 months ago
- ☆24Updated last month
- OpenAI Triton backend for Intel® GPUs☆187Updated this week
- No-code CLI designed for accelerating ONNX workflows☆192Updated 2 weeks ago
- CMake modules used within the ROCm libraries☆67Updated 2 weeks ago
- HIPIFY: Convert CUDA to Portable C++ Code☆585Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆242Updated last week
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆127Updated 5 months ago
- The Riallto Open Source Project from AMD☆79Updated last month
- IREE's PyTorch Frontend, based on Torch Dynamo.☆85Updated this week
- IREE plugin repository for the AMD AIE accelerator☆97Updated this week
- Super fast FP32 matrix multiplication on RDNA3☆61Updated 2 months ago
- ☆46Updated this week
- Fork of LLVM to support AMD AIEngine processors☆143Updated this week
- ROCm Parallel Primitives☆172Updated this week