codeplaysoftware / portDNN
portDNN is a library implementing neural network algorithms written using SYCL
☆111Updated 10 months ago
Alternatives and similar repositories for portDNN:
Users that are interested in portDNN are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆93Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆104Updated 7 years ago
- SYCL Open Source Specification☆131Updated this week
- SYCL Conformance Tests☆68Updated this week
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- Stretching GPU performance for GEMMs and tensor contractions.☆234Updated 2 weeks ago
- Intel® GPU Compute Samples☆105Updated last week
- ROCm Parallel Primitives☆171Updated last week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆327Updated this week
- RAND library for HIP programming language☆117Updated last week
- SYCL Benchmark Suite☆64Updated last month
- ☆34Updated last year
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- ROCm Device Libraries☆97Updated 10 months ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated last week
- ☆61Updated 3 months ago
- Reusable software components for ROCm developers☆83Updated last week
- Implementation of the SYCL specification.☆66Updated 9 months ago
- ☆55Updated 2 years ago
- Flexible GPGPU instrumentation☆86Updated 5 years ago
- AMD's graph optimization engine.☆213Updated this week
- ☆20Updated 2 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- SYCL for Vitis: Experimental fusion of triSYCL with Intel SYCL oneAPI DPC++ up-streaming effort into Clang/LLVM☆116Updated 4 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆80Updated this week
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆99Updated last week
- Next generation SPARSE implementation for ROCm platform☆119Updated this week
- ☆150Updated 2 weeks ago
- Examples for using SYCL on CUDA☆62Updated last month