ROCm / jaxLinks
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆24Updated this week
Alternatives and similar repositories for jax
Users that are interested in jax are comparing it to the libraries listed below
Sorting:
- Development repository for the Triton language and compiler☆125Updated this week
- 8-bit CUDA functions for PyTorch☆53Updated 3 weeks ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆73Updated this week
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- AMD SMI☆78Updated this week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆19Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- Ongoing research training transformer models at scale☆24Updated last week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆84Updated last week
- RCCL Performance Benchmark Tests☆70Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆110Updated this week
- ☆63Updated this week
- ☆60Updated last year
- rocWMMA☆119Updated this week
- Bandwidth test for ROCm☆60Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆260Updated this week
- Fast and memory-efficient exact attention☆177Updated this week
- ROCm BLAS marshalling library☆144Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆22Updated this week
- ☆48Updated last month
- ROCm's Thunk Interface☆91Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆382Updated this week
- CMake modules used within the ROCm libraries☆68Updated last week
- ☆40Updated this week
- HIPCC: HIP compiler driver☆40Updated last year
- ROC profiler library. Profiling with perf-counters and derived metrics.☆150Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆31Updated this week
- Legacy ROCm Software Platform Documentation☆112Updated 2 years ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆161Updated this week
- ROCm SMI LIB☆140Updated this week