codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆261Updated 3 months ago
Alternatives and similar repositories for portBLAS:
Users that are interested in portBLAS are comparing it to the libraries listed below
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 11 months ago
- SYCL Open Source Specification☆134Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- STREAM, for lots of devices written in many programming models☆334Updated 8 months ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆441Updated 6 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆328Updated this week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- SYCL Benchmark Suite☆64Updated 2 months ago
- ROCm Parallel Primitives☆171Updated last week
- ☆239Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆116Updated last year
- Next generation LAPACK implementation for ROCm platform☆100Updated this week
- High-level C++ for Accelerator Clusters☆145Updated last week
- ☆251Updated this week
- Next generation SPARSE implementation for ROCm platform☆121Updated this week
- Abstraction Library for Parallel Kernel Acceleration☆378Updated last month
- SYCL Conformance Tests☆70Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆237Updated last week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆108Updated this week
- Next generation FFT implementation for ROCm☆191Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆178Updated 2 years ago
- Kernel Tuning Toolkit☆59Updated last month
- Open Source Parallel STL implementation☆526Updated last year
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last month
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆219Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆152Updated this week
- Next generation BLAS implementation for ROCm platform☆367Updated this week
- RAND library for HIP programming language☆118Updated this week
- Examples for HIP☆205Updated 5 months ago