oneapi-src / Velocity-Bench
☆45Updated this week
Alternatives and similar repositories for Velocity-Bench:
Users that are interested in Velocity-Bench are comparing it to the libraries listed below
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆66Updated this week
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Advanced Profiling and Analytics for AMD Hardware☆144Updated this week
- ☆20Updated 2 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆138Updated this week
- Compute Benchmarks for oneAPI Level Zero and OpenCL™ Driver☆37Updated 2 weeks ago
- SYCL Benchmark Suite☆64Updated last month
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated this week
- ☆83Updated this week
- Profiling Tools Interfaces for GPU (PTI for GPU) is a set of Getting Started Documentation and Tools Library to start performance analysi…☆225Updated this week
- SYCL Conformance Tests☆68Updated this week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆51Updated last month
- rocWMMA☆105Updated this week
- Sandia OpenSHMEM is an implementation of the OpenSHMEM specification over multiple Networking APIs, including Portals 4, the Open Fabric …☆67Updated last week
- ☆61Updated 3 months ago
- A tool for generating information about the matrix multiplication instructions in AMD Radeon™ and AMD Instinct™ accelerators☆80Updated this week
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆50Updated this week
- ☆44Updated this week
- ROCm SPARSE marshalling library☆67Updated this week
- SST Structural Simulation Toolkit Parallel Discrete Event Core and Services☆141Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated this week
- tools to create performance and roofline plots from measured data☆58Updated 10 years ago
- ☆23Updated this week
- ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs☆79Updated last week
- GPUDirect Async support for IB Verbs☆107Updated 2 years ago
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆132Updated this week
- A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.☆60Updated 3 weeks ago
- Trying to figure various CPU things out☆76Updated last year
- The University of Bristol HPC Simulation Engine☆96Updated 3 weeks ago