NVIDIA / grace-kernel
Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await upstreaming.
☆15Updated last year
Alternatives and similar repositories for grace-kernel:
Users that are interested in grace-kernel are comparing it to the libraries listed below
- NVIDIA's launch, startup, and logging scripts used by our MLPerf Training and HPC submissions☆24Updated last week
- Bandwidth test for ROCm☆52Updated this week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆133Updated this week
- A system validation and diagnostics tool for monitoring, stress testing, detecting, and troubleshooting issues impacting AMD GPUs in high…☆66Updated this week
- This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.☆53Updated this week
- Benchmarks to capture important workloads.☆29Updated this week
- Magnum IO community repo☆81Updated this week
- CUDA 12.2 HMM demos☆19Updated 5 months ago
- MPI accelerator-integrated communication extensions☆32Updated last year
- ROCm BLAS marshalling library☆125Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- ☆99Updated 2 months ago
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆78Updated this week
- AMD SMI☆46Updated this week
- CMake modules used within the ROCm libraries☆63Updated this week
- Linux Cross-Memory Attach☆89Updated 4 months ago
- RAND library for HIP programming language☆114Updated this week
- AMD’s C++ library for accelerating tensor primitives☆38Updated this week
- RCCL Performance Benchmark Tests☆55Updated this week
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆210Updated this week
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆57Updated this week
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated 2 years ago
- GPUDirect Async support for IB Verbs☆92Updated 2 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Vendor-neutral library for exposing power and performance features across diverse architectures☆70Updated 3 months ago
- Advanced Profiling and Analytics for AMD Hardware☆139Updated this week
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- The compiler support repository provides various Lightning Compiler related services.☆45Updated 8 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆48Updated this week
- NVIDIA GPUDirect Storage Driver☆215Updated last month