ROCm / jaxLinks
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆24Updated last week
Alternatives and similar repositories for jax
Users that are interested in jax are comparing it to the libraries listed below
Sorting:
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆24Updated last week
- 8-bit CUDA functions for PyTorch☆69Updated 2 months ago
- RCCL Performance Benchmark Tests☆81Updated last week
- Development repository for the Triton language and compiler☆137Updated this week
- ROCm's Thunk Interface☆92Updated 9 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆269Updated last week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆90Updated this week
- ☆52Updated 3 weeks ago
- A collection of examples for the ROCm software stack☆263Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆391Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆114Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆34Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆111Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆147Updated this week
- CMake modules used within the ROCm libraries☆70Updated this week
- OpenAI Triton backend for Intel® GPUs☆222Updated this week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆22Updated this week
- ☆10Updated last week
- ☆67Updated last week
- ☆60Updated 2 years ago
- Bandwidth test for ROCm☆70Updated last week
- ☆130Updated this week
- Fast and memory-efficient exact attention☆202Updated this week
- ☆155Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆149Updated last week
- ☆67Updated this week
- ☆144Updated 2 weeks ago
- oneAPI Level Zero Conformance & Performance test content☆58Updated last week
- AMD SMI☆100Updated last week