erik / cudamon
GPU monitor for CUDA devices
☆14Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for cudamon
- GPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types…☆76Updated last month
- portDNN is a library implementing neural network algorithms written using SYCL☆108Updated 5 months ago
- ROC profiler library. Profiling with perf-counters and derived metrics.☆128Updated this week
- The classic STREAM benchmark, extended to measure NUMA effects.☆35Updated 5 years ago
- gmonitor is a GPU monitor (Nvidia only at the moment)☆208Updated 5 years ago
- Example of programmatic monitoring of Nvidia GPUs in C++ using NVML library☆31Updated 2 years ago
- An extension of rCUDA that enables remote-to-local GPU migration☆36Updated 8 years ago
- OpenCL memory benchmark☆13Updated 7 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- PyTorch -> ONNX -> TVM for autotuning☆23Updated 4 years ago
- Examples for HIP☆200Updated this week
- JPEG encoder and decoder library and console application for NVIDIA GPUs from CESNET and SITOLA of Faculty of Informatics at Masaryk Univ…☆243Updated 3 weeks ago
- Microbenchmarks and Google Benchmark library☆21Updated 3 months ago
- A 128 bit unsigned integer class for CUDA☆43Updated 3 years ago
- Simple message passing library☆22Updated 6 years ago
- clone of https://code.google.com/p/opencl-book-samples (there's an official repo here https://github.com/bgaster/opencl-book-samples)☆44Updated 11 years ago
- Learn OpenCL step by step.☆131Updated 2 years ago
- HCC Sample Applications☆13Updated 7 years ago
- ROCm SMI LIB☆123Updated this week
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆43Updated 3 months ago
- Implementation of a simple CNN using CUDA☆64Updated 7 years ago
- Intel® GPU Compute Samples☆98Updated 6 months ago
- The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA…☆85Updated 4 years ago
- Convolutional Neural Networks☆32Updated 6 years ago
- SYCL Open Source Specification☆114Updated this week
- Examples for using SYCL on CUDA☆60Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆92Updated 2 years ago
- Intel® SHMEM - Device initiated shared memory based communication library☆18Updated last week