codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆109Updated 8 months ago
Alternatives and similar repositories for portDNN:
Users that are interested in portDNN are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆262Updated 2 weeks ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- Intel® GPU Compute Samples☆102Updated this week
- AMD's graph optimization engine.☆196Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆320Updated last week
- SYCL Open Source Specification☆123Updated this week
- ROCm Parallel Primitives☆169Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- SYCL Conformance Tests☆65Updated last week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆231Updated this week
- MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.☆127Updated last year
- ☆60Updated last month
- ROCm Device Libraries☆98Updated 8 months ago
- SYCL Benchmark Suite☆60Updated 4 months ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆83Updated 11 months ago
- Implementation of the SYCL specification.☆67Updated 7 months ago
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆322Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆172Updated 2 years ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆96Updated this week
- RAND library for HIP programming language☆115Updated this week
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆115Updated 2 months ago
- ☆147Updated last month
- An extension library of WMMA API (Tensor Core API)☆87Updated 6 months ago
- Kernel Tuning Toolkit☆56Updated 2 months ago
- A thin wrapper around miOpen and cuDNN☆40Updated last year
- ☆55Updated 2 years ago
- Next generation FFT implementation for ROCm☆185Updated this week