amd / fuzzyHSALinks
☆54Updated last year
Alternatives and similar repositories for fuzzyHSA
Users that are interested in fuzzyHSA are comparing it to the libraries listed below
Sorting:
- Super fast FP32 matrix multiplication on RDNA3☆64Updated 2 months ago
- ☆58Updated 11 months ago
- ☆21Updated last month
- High-Performance SGEMM on CUDA devices☆95Updated 5 months ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆177Updated this week
- LLM training in simple, raw C/HIP for AMD GPUs☆49Updated 9 months ago
- Tensor Tiling Library☆36Updated 2 months ago
- Custom PTX Instruction Benchmark☆126Updated 3 months ago
- ☆46Updated last week
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆66Updated this week
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆44Updated last month
- RDNA3 emulator☆54Updated 2 months ago
- AI Tensor Engine for ROCm☆208Updated this week
- ☆108Updated last week
- ☆113Updated 2 weeks ago
- ☆119Updated 2 months ago
- ☆447Updated 2 months ago
- rocWMMA☆115Updated last week
- Bandwidth test for ROCm☆58Updated last month
- tenstorrent kernel from twitch☆28Updated last year
- Derived from Nemes' gpuperftests☆30Updated 11 months ago
- ☆230Updated last week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆284Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆106Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated last week
- AMD related optimizations for transformer models☆79Updated 7 months ago
- Make PyTorch models at least run on APUs.☆55Updated last year
- ROCm BLAS marshalling library☆144Updated last week
- User-Mode Driver for Tenstorrent hardware☆24Updated this week
- Experimental GPU language with meta-programming☆23Updated 9 months ago