intel / opencl-intercept-layer
Intercept Layer for Debugging and Analyzing OpenCL Applications
☆326Updated 2 weeks ago
Alternatives and similar repositories for opencl-intercept-layer:
Users that are interested in opencl-intercept-layer are comparing it to the libraries listed below
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆223Updated 3 weeks ago
- An OpenCL device simulator and debugger☆353Updated 6 months ago
- SYCL Open Source Specification☆130Updated this week
- Collection of samples and utilities for using ComputeCpp, Codeplay's SYCL implementation☆323Updated last year
- SYCL Conformance Tests☆68Updated this week
- Intel® GPU Compute Samples☆105Updated 2 weeks ago
- The OpenCL Conformance Tests☆200Updated this week
- A tool which profiles OpenCL devices to find their peak capacities☆435Updated 2 months ago
- ☆150Updated this week
- portDNN is a library implementing neural network algorithms written using SYCL☆111Updated 10 months ago
- The OpenCL ICD Loader project.☆257Updated 2 weeks ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆100Updated this week
- ROCm Parallel Primitives☆170Updated this week
- oneAPI Level Zero Specification Headers and Loader☆239Updated last week
- Examples for HIP☆203Updated 3 months ago
- Experimental OpenCL SPIR-V to OpenCL C translator☆25Updated last month
- Generic system-wide modern C++ for heterogeneous platforms with SYCL from Khronos Group☆441Updated 4 months ago
- ☆138Updated 2 months ago
- SYCL Benchmark Suite☆64Updated last month
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆105Updated 7 years ago
- Khronos OpenCL-CLHPP☆392Updated 2 months ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- Next generation FFT implementation for ROCm☆188Updated this week
- A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).☆527Updated last week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆237Updated this week
- ☆20Updated last year
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆233Updated this week
- STREAM, for lots of devices written in many programming models☆329Updated 6 months ago