ROCm / hipify_torch
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for hipify_torch
- rocWMMA☆92Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆75Updated last week
- CMake modules used within the ROCm libraries☆58Updated this week
- AMD’s C++ library for accelerating tensor primitives☆35Updated this week
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆43Updated 10 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆35Updated 6 months ago
- Random number library that generate pseudo-random and quasi-random numbers.☆24Updated this week
- AMD's graph optimization engine.☆186Updated this week
- hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditiona…☆63Updated this week
- hipFFT is a FFT marshalling library.☆54Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆36Updated this week
- HIPCC: HIP compiler driver☆40Updated 6 months ago
- SYCL Conformance Tests☆62Updated this week
- ☆14Updated last week
- Thrust, CUB, TBB, AVX2, CUDA, OpenCL, OpenMP, SyCL - all it takes to sum a lot of numbers fast!☆73Updated 6 months ago
- Bandwidth test for ROCm☆49Updated this week
- ☆103Updated this week
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆47Updated last week
- Unified compiler/runtime for interfacing with PyTorch Dynamo.☆95Updated this week
- GPGMM, a General-Purpose GPU Memory Management Library.☆32Updated 9 months ago
- ☆88Updated last week
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆37Updated last year
- A thin wrapper around miOpen and cuDNN☆38Updated last year
- ROCm Device Libraries☆98Updated 6 months ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆105Updated 3 months ago
- RAND library for HIP programming language☆111Updated this week
- Fork of LLVM to support AMD AIEngine processors☆106Updated this week
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆68Updated 10 months ago
- ☆128Updated this week
- A High-Throughput Parallel Lossless Compressor for Scientific Data☆61Updated last year