ROCm / HIP
HIP: C++ Heterogeneous-Compute Interface for Portability
☆3,690Updated this week
Related projects: ⓘ
- ArrayFire: a general purpose GPU library.☆4,525Updated last week
- a language for fast, portable data-parallel computation☆5,835Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,907Updated 7 months ago
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,293Updated 7 months ago
- Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C+…☆1,335Updated 2 weeks ago
- AMD's Machine Intelligence Library☆1,049Updated this week
- Patterns and behaviors for GPU computing☆1,638Updated 2 years ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,669Updated 11 months ago
- Intel® Implicit SPMD Program Compiler☆2,473Updated this week
- oneAPI Deep Neural Network Library (oneDNN)☆3,579Updated this week
- Tuned OpenCL BLAS☆1,046Updated 3 months ago
- A C++ GPU Computing Library for OpenCL☆1,544Updated 3 weeks ago
- AMD ROCm™ Software - GitHub Home☆4,493Updated this week
- Assembler for NVIDIA Maxwell architecture☆940Updated last year
- pocl - Portable Computing Language☆911Updated this week
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆428Updated 4 years ago
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆840Updated 2 months ago
- Vulkan/CUDA/HIP/OpenCL/Level Zero/Metal Fast Fourier Transform library☆1,518Updated last week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆1,945Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆5,603Updated this week
- C++ tensors with broadcasting and lazy computing☆3,310Updated last month
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,230Updated 5 months ago
- Source code examples from the Parallel Forall Blog☆1,223Updated last month
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,281Updated this week
- The cling C++ interpreter☆3,460Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆499Updated this week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,559Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,187Updated last week
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,735Updated 3 years ago
- a software library containing BLAS functions written in OpenCL☆839Updated last month