ROCm / hipify_torch
☆19Updated 5 months ago
Alternatives and similar repositories for hipify_torch:
Users that are interested in hipify_torch are comparing it to the libraries listed below
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated last week
- CMake modules used within the ROCm libraries☆65Updated this week
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last week
- Bandwidth test for ROCm☆54Updated 2 weeks ago
- rocWMMA☆105Updated this week
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week
- OpenCL/SPIR-V implementation of HIP☆104Updated 2 years ago
- hipFFT is a FFT marshalling library.☆60Updated this week
- HIPCC: HIP compiler driver☆41Updated 10 months ago
- ☆34Updated last week
- Reusable software components for ROCm developers☆83Updated this week
- ☆17Updated 2 weeks ago
- ROCm Systems Profiler☆16Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆40Updated last week
- SYCL Conformance Tests☆68Updated last week
- SYCL Reference Manual☆27Updated 11 months ago
- ☆54Updated 9 months ago
- The HIP Environment and ROCm Kit - A lightweight open source build system for HIP and ROCm☆37Updated this week
- ROCm Device Libraries☆97Updated 10 months ago
- AMD ROCm Performance Primitives (RPP) library is a comprehensive high-performance computer vision library for AMD processors with HIP/Ope…☆59Updated this week
- ☆25Updated this week
- ROCm BLAS marshalling library☆134Updated this week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆99Updated last month
- An implementation of HIP that works on CPUs, across OSes.☆115Updated last year
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated last week
- AMD lab notes with code examples to demonstrate use of AMD GPUs☆96Updated 9 months ago
- An extension library of WMMA API (Tensor Core API)☆93Updated 8 months ago
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆16Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week