awreece / memory-bandwidth-demoLinks
An attempt at achieving the theoretical best memory bandwidth of my machine.
☆53Updated 12 years ago
Alternatives and similar repositories for memory-bandwidth-demo
Users that are interested in memory-bandwidth-demo are comparing it to the libraries listed below
Sorting:
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆113Updated 3 years ago
- TLB Benchmarks☆34Updated 8 years ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 3 years ago
- GPUDirect Async support for IB Verbs☆132Updated 2 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆91Updated last year
- an API and runtime environment for data processing with MapReduce for shared-memory multi-core & multiprocessor systems.☆95Updated last year
- ☆34Updated 3 years ago
- OFI Programmer's Guide☆53Updated 2 years ago
- ☆62Updated last year
- ☆64Updated 6 years ago
- GPUfs - File system support for NVIDIA GPUs☆97Updated 6 years ago
- The classic STREAM benchmark, extended to measure NUMA effects.☆38Updated 6 years ago
- Graph500 reference implementations☆180Updated 3 years ago
- A tool for measuring the cache-coherence latencies of a processor (i.e., the latencies of loads, stores, CAS, FAI, TAS, and SWAP).☆78Updated 3 years ago
- Collection of synchronization micro-benchmarks and traces from infrastructure applications☆48Updated 3 months ago
- Measure instruction latency and throughput☆25Updated 2 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆80Updated 2 months ago
- The SHOC Benchmark Suite☆257Updated 3 weeks ago
- Linear algebra subroutines for large SSD-resident dense and sparse matrices☆28Updated 4 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆41Updated 4 years ago
- Automatic virtualization of (general) accelerators.☆44Updated 2 years ago
- Flexible GPGPU instrumentation☆88Updated 6 years ago
- ☆46Updated 8 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆66Updated 7 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- tools to create performance and roofline plots from measured data☆59Updated 11 years ago
- ☆141Updated 2 months ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 6 years ago