deater / performance_results
performance results/benchmarks for a variety of machines
☆27Updated 2 months ago
Alternatives and similar repositories for performance_results:
Users that are interested in performance_results are comparing it to the libraries listed below
- GPUDirect example☆59Updated 3 years ago
- GPUDirect Async support for IB Verbs☆111Updated 2 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆89Updated last year
- CUPTI GPU Profiler☆37Updated 6 years ago
- NCCL Examples from Official NVIDIA NCCL Developer Guide.☆17Updated 6 years ago
- Automated machine learning as an AI-HPC benchmark☆66Updated 2 years ago
- Examples showing how to utilize the NVML library for GPU monitoring☆28Updated 2 years ago
- Forked from https://bitbucket.org/berkeleylab/cs-roofline-toolkit/src/master/☆19Updated 5 years ago
- Automatic virtualization of (general) accelerators.☆43Updated 2 years ago
- ☆25Updated 5 years ago
- EEMBC's Machine-Learning Inference Benchmark targeted at edge devices.☆46Updated 3 years ago
- A Portable C Library for Distributed CNN Inference on IoT Edge Clusters☆83Updated 5 years ago
- CUDA for MNIST training/inference☆40Updated last year
- Magnum IO community repo☆90Updated 3 months ago
- ☆13Updated 10 years ago
- RCCL Performance Benchmark Tests☆64Updated this week
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆39Updated 2 weeks ago
- example code for using DC QP for providing RDMA READ and WRITE operations to remote GPU memory☆128Updated 8 months ago
- oneAPI Collective Communications Library (oneCCL)☆232Updated this week
- A tool for examining GPU scheduling behavior.☆81Updated 8 months ago
- 🔮 Execution time predictions for deep neural network training iterations across different GPUs.☆61Updated 2 years ago
- ☆47Updated 2 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆64Updated 6 years ago
- A highly scalable framework for the performance and energy monitoring of HPC servers☆17Updated 3 weeks ago
- An Efficient Dynamic Resource Scheduler for Deep Learning Clusters☆42Updated 7 years ago
- HPC Challenge Benchmark☆52Updated 2 years ago
- GPTPU for SC 2021☆51Updated 2 years ago
- ROCm Driver RDMA Peer to Peer Support☆21Updated 6 years ago
- First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.☆47Updated 7 years ago
- OpenSHMEM Reference Implementation over UCX for Specification 1.4 and up☆35Updated last year