ROCm / MADLinks
MAD (Model Automation and Dashboarding)
☆31Updated 2 weeks ago
Alternatives and similar repositories for MAD
Users that are interested in MAD are comparing it to the libraries listed below
Sorting:
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Updated last week
- ☆59Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 7 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated last week
- ☆22Updated 3 months ago
- Ongoing research training transformer models at scale☆35Updated last week
- ☆74Updated this week
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- Bandwidth test for ROCm☆73Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated last week
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆164Updated last week
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆155Updated last week
- AMD HPC Research Fund Cloud☆17Updated last week
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆64Updated this week
- oneAPI Collective Communications Library (oneCCL)☆253Updated last month
- AI Tensor Engine for ROCm☆344Updated last week
- A hierarchical collective communications library with portable optimizations☆37Updated last year
- Ahead of Time (AOT) Triton Math Library☆88Updated this week
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆63Updated last month
- Microsoft Collective Communication Library☆66Updated last year
- COCCL: Compression and precision co-aware collective communication library☆29Updated 10 months ago
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- Development repository for the Triton language and compiler☆140Updated this week
- OpenAI Triton backend for Intel® GPUs☆225Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Updated this week
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Updated 9 months ago
- Multi-GPU communication profiler and visualizer☆37Updated last year
- ☆24Updated 3 months ago
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆56Updated this week