ROCm / hip
HIP: C++ Heterogeneous-Compute Interface for Portability
☆3,940Updated this week
Alternatives and similar repositories for hip:
Users that are interested in hip are comparing it to the libraries listed below
- AMD's Machine Intelligence Library☆1,130Updated this week
- Compiler for multiple programming models (SYCL, C++ standard parallelism, HIP/CUDA) for CPUs and GPUs from all vendors: The independent, …☆1,560Updated this week
- ArrayFire: a general purpose GPU library.☆4,654Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,299Updated last year
- HCC is an Open Source, Optimizing C++ Compiler for Heterogeneous Compute currently for the ROCm GPU Computing Platform☆437Updated 4 years ago
- a language for fast, portable data-parallel computation☆6,008Updated this week
- AMD ROCm™ Software - GitHub Home☆5,104Updated this week
- pocl - Portable Computing Language☆970Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,953Updated last year
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,735Updated last year
- HIPIFY: Convert CUDA to Portable C++ Code☆564Updated this week
- BLAS-like Library Instantiation Software Framework☆2,396Updated 3 weeks ago
- CUDA Core Compute Libraries☆1,555Updated this week
- Patterns and behaviors for GPU computing☆1,707Updated 2 years ago
- Intel® Implicit SPMD Program Compiler☆2,617Updated this week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,146Updated this week
- An efficient C++17 GPU numerical computing library with Python-like syntax☆1,300Updated this week
- A C++ GPU Computing Library for OpenCL☆1,589Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: The Programming Model - Parallel Execution and Memory Abstraction☆2,144Updated this week
- Tuned OpenCL BLAS☆1,090Updated 4 months ago
- C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))☆2,333Updated this week
- Build NVIDIA® CUDA™ code for OpenCL™ 1.2 devices☆857Updated 9 months ago
- VUDA is a header-only library based on Vulkan that provides a CUDA Runtime API interface for writing GPU-accelerated applications.☆884Updated last year
- Assembler for NVIDIA Maxwell architecture☆981Updated 2 years ago
- C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.☆2,126Updated this week
- Thin, unified, C++-flavored wrappers for the CUDA APIs☆825Updated this week
- OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.☆6,645Updated last week
- C++ tensors with broadcasting and lazy computing☆3,470Updated 3 weeks ago
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- oneAPI Threading Building Blocks (oneTBB)☆6,010Updated this week