spectral-compute / scale-examplesLinks
☆66Updated last year
Alternatives and similar repositories for scale-examples
Users that are interested in scale-examples are comparing it to the libraries listed below
Sorting:
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆205Updated last year
- ☆63Updated last week
- ☆53Updated last year
- ☆183Updated 2 weeks ago
- Source code for Intel's Polite Guard NLP project☆37Updated last month
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆51Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆104Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆203Updated 4 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆92Updated this week
- Tensor library & inference framework for machine learning☆117Updated 4 months ago
- ☆281Updated this week
- chipStar is a tool for compiling and running HIP/CUDA on SPIR-V via OpenCL or Level Zero APIs.☆315Updated last week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆23Updated this week
- No-code CLI designed for accelerating ONNX workflows☆227Updated 7 months ago
- ☆191Updated last year
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆18Updated 5 years ago
- Arm AArch64 to RISC-V Transpiler☆35Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆42Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆33Updated last week
- ☆200Updated 9 months ago
- AI/GPU flame graph☆242Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- Repository of model demos using TT-Buda☆63Updated 10 months ago
- Rust crates for XetHub☆78Updated last year
- CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning☆417Updated last month
- Exploring the scalable matrix extension of the Apple M4 processor☆220Updated last year
- Fully Open Language Models with Stellar Performance☆318Updated 2 months ago
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 11 months ago
- RDNA3 emulator☆55Updated 9 months ago