amd / fuzzyHSALinks
☆54Updated last year
Alternatives and similar repositories for fuzzyHSA
Users that are interested in fuzzyHSA are comparing it to the libraries listed below
Sorting:
- ☆62Updated last year
- Super fast FP32 matrix multiplication on RDNA3☆71Updated 5 months ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆296Updated last week
- RDNA3 emulator☆54Updated 4 months ago
- ☆450Updated 4 months ago
- High-Performance SGEMM on CUDA devices☆97Updated 7 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen ™ AI Software Platform enables developers to take…☆77Updated 2 weeks ago
- Make PyTorch models at least run on APUs.☆56Updated last year
- LLM training in simple, raw C/HIP for AMD GPUs☆51Updated 11 months ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆331Updated this week
- ☆21Updated last week
- ☆134Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆111Updated this week
- ☆149Updated last week
- Custom PTX Instruction Benchmark☆126Updated 6 months ago
- Repository of model demos using TT-Buda☆62Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆42Updated 2 weeks ago
- Nvidia Instruction Set Specification Generator☆290Updated last year
- Deep Learning Primitives and Mini-Framework for OpenCL☆201Updated 11 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated last year
- A small OpenCL benchmark program to measure peak GPU/CPU performance.☆239Updated last month
- User-Mode Driver for Tenstorrent hardware☆31Updated last week
- HIPIFY: Convert CUDA to Portable C++ Code☆613Updated last week
- ☆461Updated this week
- OpenCL/SPIR-V implementation of HIP☆105Updated 2 years ago
- A collection of examples for the ROCm software stack☆236Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆52Updated 6 months ago
- ☆58Updated this week
- AI Tensor Engine for ROCm☆260Updated this week
- Gpu benchmark☆66Updated 7 months ago