ROCm / MADLinks
MAD (Model Automation and Dashboarding)
☆31Updated this week
Alternatives and similar repositories for MAD
Users that are interested in MAD are comparing it to the libraries listed below
Sorting:
- ☆60Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆86Updated 2 weeks ago
- Ongoing research training transformer models at scale☆35Updated this week
- Bandwidth test for ROCm☆75Updated last week
- ☆23Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆144Updated this week
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆63Updated last month
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆64Updated 7 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- Multi-GPU communication profiler and visualizer☆37Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆156Updated this week
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆60Updated this week
- A hierarchical collective communications library with portable optimizations☆37Updated last year
- oneCCL Bindings for Pytorch* (deprecated)☆104Updated last month
- ☆74Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆66Updated last week
- This repository contains the results and code for the MLPerf™ Training v2.0 benchmark.☆29Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆410Updated this week
- AMD HPC Research Fund Cloud☆17Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆114Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆57Updated this week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆203Updated last week
- ☆24Updated 3 months ago
- MLPerf™ logging library☆38Updated last month
- oneAPI Level Zero Conformance & Performance test content☆60Updated this week
- ☆47Updated last year
- Test suite for probing the numerical behavior of NVIDIA tensor cores☆41Updated last year
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆95Updated last week
- Development repository for the Triton language and compiler☆140Updated last week