amd / fuzzyHSALinks
☆53Updated last year
Alternatives and similar repositories for fuzzyHSA
Users that are interested in fuzzyHSA are comparing it to the libraries listed below
Sorting:
- ☆65Updated last year
- Super fast FP32 matrix multiplication on RDNA3☆81Updated 9 months ago
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆61Updated last week
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆88Updated 2 weeks ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆307Updated last week
- RDNA3 emulator☆55Updated 8 months ago
- Fast and Furious AMD Kernels☆324Updated last week
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆252Updated 2 weeks ago
- High-Performance SGEMM on CUDA devices☆114Updated 11 months ago
- Custom PTX Instruction Benchmark☆137Updated 10 months ago
- ☆22Updated 2 months ago
- ctypes wrappers for HIP, CUDA, and OpenCL☆130Updated last year
- ☆246Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- Tensor Tiling Library☆38Updated 3 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated last year
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆92Updated 2 weeks ago
- A GPU port of DOOM☆178Updated 5 months ago
- Nvidia Instruction Set Specification Generator☆306Updated last year
- OpenCL/SPIR-V implementation of HIP☆105Updated 3 years ago
- A collection of examples for the ROCm software stack☆265Updated last week
- Repository of model demos using TT-Buda☆63Updated 8 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- LLM training in simple, raw C/HIP for AMD GPUs☆56Updated last year
- AMD SMI☆103Updated 2 weeks ago
- Tenstorrent console based hardware information program☆57Updated last week
- ☆155Updated last week
- User-Mode Driver for Tenstorrent hardware☆36Updated this week
- Deep Learning Primitives and Mini-Framework for OpenCL☆205Updated last year
- Bandwidth test for ROCm☆72Updated 2 weeks ago