ROCm / hipify_torch
☆18Updated 4 months ago
Alternatives and similar repositories for hipify_torch:
Users that are interested in hipify_torch are comparing it to the libraries listed below
- rocWMMA☆100Updated this week
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- AMD’s C++ library for accelerating tensor primitives☆38Updated this week
- Bandwidth test for ROCm☆54Updated this week
- CMake modules used within the ROCm libraries☆64Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated this week
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week
- ☆17Updated last week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- HIPCC: HIP compiler driver☆40Updated 9 months ago
- ☆49Updated this week
- GPGMM, a General-Purpose GPU Memory Management Library.☆33Updated 3 weeks ago
- Reusable software components for ROCm developers☆81Updated this week
- Tensor Tiling Library☆34Updated 5 months ago
- hipFFT is a FFT marshalling library.☆58Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆115Updated 11 months ago
- Utilities for accessing AMD's Machine-Readable GPU ISA Specifications.☆27Updated 5 months ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated 2 weeks ago
- ROCm Device Libraries☆97Updated 9 months ago
- AMD's graph optimization engine.☆208Updated this week
- SYCL implementation of Fused MLPs for Intel GPUs☆46Updated 3 months ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆58Updated last week
- IREE C++ Template☆17Updated 6 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆114Updated last month
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- Marek's approach to building AMD GPU drivers for driver development☆22Updated last month
- ROCm SPARSE marshalling library☆67Updated this week
- ☆20Updated 3 years ago
- ☆105Updated 3 months ago
- Fast integer division with divisor not known at compile time. To be used primarily in CUDA kernels.☆71Updated 9 years ago