codeplaysoftware / portBLASLinks
Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.
☆261Updated 4 months ago
Alternatives and similar repositories for portBLAS
Users that are interested in portBLAS are comparing it to the libraries listed below
Sorting:
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆328Updated this week
- SYCL Open Source Specification☆136Updated last week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆324Updated last year
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- STREAM, for lots of devices written in many programming models☆339Updated 9 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆271Updated 2 months ago
- ROCm Parallel Primitives☆172Updated this week
- SYCL Benchmark Suite☆64Updated 3 months ago
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆443Updated 7 months ago
- ☆245Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆119Updated this week
- SYCL Conformance Tests☆71Updated last week
- Examples for HIP☆207Updated 5 months ago
- Next generation LAPACK implementation for ROCm platform☆101Updated this week
- Full-speed Array of Structures access☆169Updated 2 years ago
- Next generation FFT implementation for ROCm☆196Updated this week
- ☆256Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆226Updated last week
- High-level C++ for Accelerator Clusters☆145Updated last week
- CUDA kernel author's tools☆111Updated 3 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆105Updated 3 weeks ago
- RAND library for HIP programming language☆120Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆241Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆119Updated last year
- Implementation of the SYCL specification.☆66Updated 11 months ago
- Next generation BLAS implementation for ROCm platform☆377Updated last week
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆220Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆178Updated 2 years ago