A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆25Mar 18, 2026Updated this week
Alternatives and similar repositories for apex
Users that are interested in apex are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Nov 11, 2025Updated 4 months ago
- ☆64Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆60Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆88Mar 5, 2026Updated 2 weeks ago
- Fast and memory-efficient exact attention☆224Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆34Feb 26, 2026Updated 3 weeks ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆68Dec 10, 2025Updated 3 months ago
- RJT-RL: De novo molecular design using a Reversible Junction Tree and Reinforcement Learning☆24Aug 22, 2022Updated 3 years ago
- CMake modules used within the ROCm libraries☆73Mar 13, 2026Updated last week
- An Open-Source Community Supported Fortran layer for AMD HIP☆10May 20, 2020Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆525Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆26Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆97Updated this week
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Mar 13, 2026Updated last week
- Watts Up? Pro/.Net meter logger☆11Aug 10, 2021Updated 4 years ago
- ROCm Install Utilities: rocminstall.py script to install a specific ROCm release version/revision.☆14Jun 20, 2025Updated 9 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Updated this week
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- Development repository for the Triton language and compiler☆143Updated this week
- Mirror only see https://gitlab.rtems.org/rtems/docs/rtems-docs/☆10Mar 13, 2026Updated last week
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Jul 9, 2025Updated 8 months ago
- ☆11Dec 16, 2016Updated 9 years ago
- AMD SMI☆119Updated this week
- A human-friendly implementation of the iRobot Open Interface version 2 API.☆14May 14, 2016Updated 9 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆390Updated this week
- ☆13Mar 10, 2026Updated last week
- Just another Julia Debugger☆14May 29, 2019Updated 6 years ago
- ☆16Aug 10, 2022Updated 3 years ago
- A Python script to convert vobsub subtitles into srt format using tesseract for ocr☆10Sep 28, 2014Updated 11 years ago
- AMD’s C++ library for accelerating tensor primitives☆49Updated this week
- OpenCL porting of the GROMACS molecular simulation toolkit☆27Sep 5, 2015Updated 10 years ago
- RISC-V System on Chip Builder☆12Sep 27, 2020Updated 5 years ago
- Utilities for ROCm Tech Support Log Collections☆13Mar 14, 2026Updated last week
- Repo for climate deep learning codes☆16May 21, 2019Updated 6 years ago
- ☆12Mar 19, 2022Updated 4 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆117Updated this week