codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆261Updated 3 months ago
Alternatives and similar repositories for portBLAS:
Users that are interested in portBLAS are comparing it to the libraries listed below
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated 10 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆327Updated this week
- SYCL Open Source Specification☆134Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- STREAM, for lots of devices written in many programming models☆332Updated 7 months ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- SYCL Benchmark Suite☆64Updated last month
- ROCm Parallel Primitives☆171Updated this week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆441Updated 5 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated last week
- Next generation FFT implementation for ROCm☆190Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆533Updated last month
- Examples for HIP☆204Updated 4 months ago
- SYCL Conformance Tests☆69Updated this week
- Next generation BLAS implementation for ROCm platform☆362Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- High-level C++ for Accelerator Clusters☆146Updated 2 weeks ago
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- ☆236Updated this week
- a software library containing Sparse functions written in OpenCL☆174Updated 5 years ago
- ☆61Updated 3 months ago
- Intel® GPU Compute Samples☆107Updated last week
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆481Updated 2 months ago
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week
- Open Source Parallel STL implementation☆524Updated last year
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆392Updated 3 months ago
- Full-speed Array of Structures access☆169Updated last year
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆269Updated 2 weeks ago