A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆25Feb 26, 2026Updated this week
Alternatives and similar repositories for apex
Users that are interested in apex are comparing it to the libraries listed below
Sorting:
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆34Feb 24, 2026Updated last week
- ☆17Nov 11, 2025Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Feb 11, 2026Updated 2 weeks ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆57Updated this week
- ☆65Updated this week
- Fast and memory-efficient exact attention☆221Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆411Feb 23, 2026Updated last week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆67Dec 10, 2025Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Updated this week
- CMake modules used within the ROCm libraries☆73Feb 23, 2026Updated last week
- Development repository for the Triton language and compiler☆141Updated this week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆96Updated this week
- This version of Chombo is fortran-free and depends on the Proto middleware infrastructure for performance portability.☆10Sep 12, 2025Updated 5 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Feb 16, 2026Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo. NOTE: develop branch is maintained as a read-only mirror☆523Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆114Updated this week
- The MolE pre-training framework to learn general molecular representations from unlabeled structures☆12May 26, 2025Updated 9 months ago
- Community maintained hardware plugin for vLLM on AWS Neuron☆23Updated this week
- Tool for finding matches to degenerate sequence motifs in FASTA files.☆13Mar 11, 2024Updated last year
- ☆10Updated this week
- Mirror only see https://gitlab.rtems.org/rtems/docs/rtems-docs/☆10Feb 21, 2026Updated last week
- Utility scripts to configure processors, perform synthesis, load onto FPGAs, and other tasks related to ProcessorCI.☆17Dec 7, 2025Updated 2 months ago
- Procyon is the brightest star in the constellation of Canis Minor. But it's also the name of my RISC-V out-of-order processor.☆12Apr 6, 2023Updated 2 years ago
- Build NCCL-Tests and configure SSHD in PyTorch container to help you test NCCL faster!☆13Aug 28, 2025Updated 6 months ago
- AMD SMI☆116Feb 20, 2026Updated last week
- ☆13Feb 10, 2026Updated 3 weeks ago
- ☆11Nov 23, 2025Updated 3 months ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 2 years ago
- An Open-Source Community Supported Fortran layer for AMD HIP☆10May 20, 2020Updated 5 years ago
- Watts Up? Pro/.Net meter logger☆11Aug 10, 2021Updated 4 years ago
- RISC-V System on Chip Builder☆12Sep 27, 2020Updated 5 years ago
- ☆12May 8, 2025Updated 9 months ago
- The official Snap package of the FreeCAD project☆13Feb 5, 2026Updated 3 weeks ago
- Using MolE pre-trained representation to predict novel antimicrobial compounds☆11Aug 28, 2025Updated 6 months ago
- ☆12Mar 19, 2022Updated 3 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- AMD’s C++ library for accelerating tensor primitives☆49Feb 18, 2026Updated last week
- Sphinx themes ("stanford" and "neo-rtd") based on readthedocs.org☆10Jan 9, 2017Updated 9 years ago