☆49Mar 10, 2026Updated 2 weeks ago
Alternatives and similar repositories for rocprof-compute-viewer
Users that are interested in rocprof-compute-viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Mar 18, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆26Feb 26, 2026Updated 3 weeks ago
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆38Aug 29, 2025Updated 6 months ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆73Feb 18, 2026Updated last month
- Header-only library of GPU-accelerated, concurrent data structures.☆12Nov 11, 2025Updated 4 months ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆153Jan 21, 2026Updated 2 months ago
- ☆10Nov 16, 2024Updated last year
- A lightweight triton-based General Matrix Multiplication (GEMM) library.☆55Updated this week
- A lightweight, general-purpose framework for evaluating GPU kernel correctness and performance.☆45Mar 17, 2026Updated last week
- HIP backend patch for Numba, the NumPy aware dynamic Python compiler using LLVM.☆19Feb 16, 2026Updated last month
- AI Tensor Engine for ROCm☆385Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 4 months ago
- Super fast FP32 matrix multiplication on RDNA3☆87Mar 30, 2025Updated 11 months ago
- ☆12May 30, 2025Updated 9 months ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆28Oct 26, 2023Updated 2 years ago
- AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming☆182Updated this week
- The C++ Standard Library for your entire system.☆27Updated this week
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆13Feb 27, 2026Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆145Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- Development repository for the Triton language and compiler☆143Updated this week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆165Mar 18, 2026Updated last week
- Personal blog of ENvironmentSet based on oversomething.☆14Oct 29, 2024Updated last year
- ☆10Nov 17, 2022Updated 3 years ago
- It is an LLM-based AI agent, which can write correct and efficient gpu kernels automatically.☆78Mar 18, 2026Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Mar 13, 2026Updated last week
- ☆16Nov 10, 2025Updated 4 months ago
- This program is a utility that reads the internal information of Windows Subsystem for Linux from the system and outputs the data to a st…☆12Dec 8, 2022Updated 3 years ago
- Use dhall as an external datasource☆11Apr 16, 2025Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆56Jan 22, 2026Updated 2 months ago
- LLDB script for dumping C++ structs/classes and variables layout in memory☆13Jun 15, 2021Updated 4 years ago
- Efficent c++ structured logging library.☆13Jun 26, 2017Updated 8 years ago
- A concise C++ demonstration of image resource interoperability between D3D11 and Vulkan.☆14Jul 14, 2024Updated last year
- Automated Unity3d project validation☆11Apr 19, 2025Updated 11 months ago
- ☆47Nov 3, 2025Updated 4 months ago
- amdgpu example code in hip/asm☆56Mar 18, 2026Updated last week
- A command-line tool that converts SAMI (.smi) to SSA/ASS (.ass)☆14May 25, 2021Updated 4 years ago
- A unix pipeline utils based on LLM☆16May 15, 2023Updated 2 years ago
- This repository is no longer maintained.☆15Mar 10, 2022Updated 4 years ago