codeplaysoftware / portBLAS
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆263Updated last month
Alternatives and similar repositories for portBLAS:
Users that are interested in portBLAS are comparing it to the libraries listed below
- portDNN is a library implementing neural network algorithms written using SYCL☆110Updated 8 months ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆322Updated 2 weeks ago
- SYCL Open Source Specification☆127Updated last week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆105Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- ROCm Parallel Primitives☆169Updated this week
- SYCL Conformance Tests☆67Updated last week
- SYCL Benchmark Suite☆61Updated last week
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆442Updated 3 months ago
- RAND library for HIP programming language☆115Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆217Updated this week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- SYCL Academy, a set of learning materials for SYCL heterogeneous programming☆473Updated 3 weeks ago
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆173Updated 2 years ago
- Next generation FFT implementation for ROCm☆188Updated this week
- Reusable software components for ROCm developers☆81Updated this week
- ROCm BLAS marshalling library☆131Updated this week
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆97Updated last week
- ☆248Updated this week
- ☆136Updated last month
- ☆228Updated this week
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- High-level C++ for Accelerator Clusters☆144Updated 2 weeks ago
- Implementation of the SYCL specification.☆67Updated 8 months ago
- Advanced Profiling and Analytics for AMD Hardware☆140Updated this week
- Next generation LAPACK implementation for ROCm platform☆98Updated this week
- ☆60Updated 2 months ago
- Examples for HIP☆202Updated 2 months ago