spectral-compute / scale-examplesLinks
☆65Updated last year
Alternatives and similar repositories for scale-examples
Users that are interested in scale-examples are comparing it to the libraries listed below
Sorting:
- ☆53Updated last year
- Source code for Intel's Polite Guard NLP project☆37Updated 2 weeks ago
- ☆62Updated last month
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆203Updated 11 months ago
- ☆175Updated last month
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆88Updated 2 weeks ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆52Updated 10 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆104Updated last year
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆23Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- Fully Open Language Models with Stellar Performance☆312Updated last month
- ☆191Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆42Updated 2 weeks ago
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆252Updated 2 weeks ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆202Updated 3 months ago
- rocDecode is a high performance video decode SDK for AMD hardware☆32Updated 2 weeks ago
- CUDA Tile IR is an MLIR-based intermediate representation and compiler infrastructure for CUDA kernel optimization, focusing on tile-base…☆649Updated last week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆236Updated this week
- Tenstorrent console based hardware information program☆57Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- No-code CLI designed for accelerating ONNX workflows☆222Updated 6 months ago
- AI/GPU flame graph☆233Updated 2 months ago
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆17Updated 5 years ago
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆307Updated last week
- Tensor library & inference framework for machine learning☆117Updated 2 months ago
- Arm AArch64 to RISC-V Transpiler☆35Updated 5 years ago
- Repository of model demos using TT-Buda☆63Updated 8 months ago
- ☆199Updated 7 months ago
- ☆57Updated 3 months ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆92Updated 2 weeks ago