ROCm / jaxLinks
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆24Updated this week
Alternatives and similar repositories for jax
Users that are interested in jax are comparing it to the libraries listed below
Sorting:
- 8-bit CUDA functions for PyTorch☆63Updated 2 weeks ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆81Updated this week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆21Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆111Updated this week
- CMake modules used within the ROCm libraries☆67Updated this week
- ROCm's Thunk Interface☆91Updated 6 months ago
- rocWMMA☆133Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆23Updated this week
- Bandwidth test for ROCm☆66Updated this week
- ☆151Updated last week
- ☆60Updated 2 years ago
- ☆66Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆261Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆383Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆149Updated this week
- Development repository for the Triton language and compiler☆133Updated this week
- ☆133Updated last week
- RCCL Performance Benchmark Tests☆77Updated this week
- A collection of examples for the ROCm software stack☆246Updated this week
- ROCm Device Libraries☆96Updated last year
- ☆51Updated 4 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆32Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆142Updated last week
- Deep Learning Primitives and Mini-Framework for OpenCL☆202Updated last year
- oneAPI Level Zero Conformance & Performance test content☆57Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated last week
- ROCm Documentation Python package for ReadTheDocs build standardization☆16Updated this week
- Fast and memory-efficient exact attention☆191Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆47Updated this week
- ☆38Updated this week