spectral-compute / scale-examplesLinks
☆58Updated 11 months ago
Alternatives and similar repositories for scale-examples
Users that are interested in scale-examples are comparing it to the libraries listed below
Sorting:
- ☆57Updated this week
- ☆54Updated last year
- Source code for Intel's Polite Guard NLP project☆35Updated last month
- Tenstorrent console based hardware information program☆43Updated this week
- A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support☆17Updated 5 years ago
- Bandwidth test for ROCm☆58Updated last month
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆44Updated last month
- Tenstorrent system interface library☆24Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆49Updated 4 months ago
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆184Updated 5 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆83Updated last week
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 3 months ago
- Rust crates for XetHub☆43Updated 8 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆106Updated this week
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆62Updated this week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆19Updated this week
- asynchronous/distributed speculative evaluation for llama3☆39Updated 10 months ago
- ☆113Updated 2 weeks ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆41Updated last week
- ☆196Updated last month
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆55Updated 3 months ago
- AI/GPU flame graph☆162Updated 3 weeks ago
- Columnar database on SSD NVMe☆14Updated 4 years ago
- LLM training in simple, raw C/HIP for AMD GPUs☆49Updated 9 months ago
- Tensor Tiling Library☆36Updated 2 months ago
- AI Tensor Engine for ROCm☆208Updated this week
- A fork of llama3.c used to do some R&D on inferencing☆22Updated 6 months ago
- GCC plugin for C language that tracks references to allocated objects☆27Updated last month
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆66Updated this week