deater / performance_resultsLinks

performance results/benchmarks for a variety of machines

☆27

Alternatives and similar repositories for performance_results

Users that are interested in performance_results are comparing it to the libraries listed below

Sorting:

intel / memory-bandwidth-benchmarks
Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's
☆89Updated last year
ROCm / TransferBench
TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
☆39Updated last week
ROCm / roctracer
ROCm Tracer Callback/Activity Library for Performance tracing AMD GPUs
☆83Updated last week
srvm / cupti_profiler
CUPTI GPU Profiler
☆37Updated 6 years ago
lightsighter / CudaDMA
Emulating DMA Engines on GPUs for Performance and Portability
☆40Updated 10 years ago
NUCAR-DEV / Hetero-Mark
A Benchmark Suite for Heterogeneous System Computation
☆53Updated 3 months ago
tbd-ai / tbd-suite
☆47Updated 2 years ago
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆131Updated last year
gpudirect / libgdsync
GPUDirect Async support for IB Verbs
☆117Updated 2 years ago
accel-sim / gpu-app-collection
A repository where GPU applications are aggregated using a common build flow that supports multiple CUDA versions.
☆65Updated last week
zoranzhao / DeepThings
A Portable C Library for Distributed CNN Inference on IoT Edge Clusters
☆82Updated 5 years ago
ROCm / rocprofiler
ROC profiler library. Profiling with perf-counters and derived metrics.
☆148Updated last week
icl-utk-edu / hpcc
HPC Challenge Benchmark
☆52Updated 2 years ago
cyanguwa / nersc-roofline
☆44Updated 4 years ago
escalab / SIMD2
☆30Updated 2 years ago
utcs-scea / ava
Automatic virtualization of (general) accelerators.
☆44Updated 2 years ago
SunsetQuest / CudaPAD
CudaPAD is a PTX/SASS viewer for NVIDIA Cuda kernels and provides an on-the-fly view of the assembly.
☆119Updated 2 years ago
tlc-pack / tlcpack
☆24Updated last year
pakmarkthub / dragon
A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data po…
☆60Updated 4 years ago
ekondis / gpumembench
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
☆105Updated 7 years ago
deep500 / deep500
A Deep Learning Meta-Framework and HPC Benchmarking Library
☆81Updated 3 years ago
cjg / GVirtuS
This repository is an archive. Refer to https://github.com/gvirtus/GVirtuS
☆43Updated 3 years ago
sunlex0717 / DissectingTensorCores
☆97Updated last year
ROCm / hipSPARSE
ROCm SPARSE marshalling library
☆67Updated this week
daadaada / gas
☆44Updated 4 years ago
escalab / GPTPU
GPTPU for SC 2021
☆52Updated 2 years ago
ROCm / ROCgdb
This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.
☆58Updated this week
CPFL / gdev
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
☆48Updated 7 years ago
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆56Updated 2 weeks ago
ROCm / MISA
Machine Intelligence Shader Autogen. AMDGPU ML shader code generator. (previously iGEMMgen)
☆34Updated last week