codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆108Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for portDNN
- An implementation of BLAS using the SYCL open standard.☆259Updated 2 weeks ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆64Updated 4 years ago
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆314Updated 2 weeks ago
- AMD's graph optimization engine.☆186Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆223Updated this week
- SYCL Open Source Specification☆116Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- Intel® GPU Compute Samples☆97Updated this week
- ☆55Updated last year
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 8 months ago
- SYCL Benchmark Suite☆56Updated 2 months ago
- ☆59Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- Implementation of the SYCL specification.☆67Updated 5 months ago
- ROCm Device Libraries☆98Updated 6 months ago
- ROCm Parallel Primitives☆162Updated this week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆322Updated last year
- SYCL Conformance Tests☆62Updated this week
- CLTune: An automatic OpenCL & CUDA kernel tuner☆170Updated last year
- Kernel Tuning Toolkit☆55Updated 2 weeks ago
- Examples for using SYCL on CUDA☆60Updated 2 weeks ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆110Updated 2 weeks ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆88Updated last week
- MIOpenGEMM is now deprecated☆61Updated last year
- ☆67Updated 2 years ago
- Conversions to MLIR EmitC☆124Updated 2 months ago
- RAND library for HIP programming language☆111Updated this week
- Full-speed Array of Structures access☆161Updated last year
- Flexible GPGPU instrumentation☆86Updated 5 years ago