intel / pti-gpu
Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysis on Intel(R) Processor Graphics easily
☆225Updated last week
Alternatives and similar repositories for pti-gpu:
Users that are interested in pti-gpu are comparing it to the libraries listed below
- oneAPI Level Zero Specification Headers and Loader☆255Updated this week
- Advanced Profiling and Analytics for AMD Hardware☆145Updated this week
- Intercept Layer for Debugging and Analyzing OpenCL Applications☆326Updated last week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- ☆236Updated this week
- ☆20Updated 2 years ago
- ☆20Updated 3 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆141Updated this week
- oneAPI Collective Communications Library (oneCCL)☆232Updated last week
- STREAM, for lots of devices written in many programming models☆332Updated 7 months ago
- ☆61Updated 3 months ago
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆141Updated this week
- SYCL Benchmark Suite☆64Updated last month
- ROCm Parallel Primitives☆171Updated this week
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆37Updated this week
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆261Updated 2 months ago
- Intel® GPU Compute Samples☆107Updated last week
- SYCL Open Source Specification☆134Updated this week
- ☆150Updated 3 weeks ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆82Updated this week
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆132Updated last week
- ☆83Updated last week
- RAND library for HIP programming language☆117Updated this week
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆241Updated this week
- ☆138Updated 2 months ago
- Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben A…☆269Updated 2 weeks ago
- Examples for HIP☆204Updated 4 months ago
- ☆141Updated this week
- Stretching GPU performance for GEMMs and tensor contractions.☆235Updated this week