awreece / memory-bandwidth-demoLinks
An attempt at achieving the theoretical best memory bandwidth of my machine.
☆53Updated 12 years ago
Alternatives and similar repositories for memory-bandwidth-demo
Users that are interested in memory-bandwidth-demo are comparing it to the libraries listed below
Sorting:
- TLB Benchmarks☆34Updated 8 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆30Updated last year
- Parallel Memory Bandwidth Measurement / Benchmark Tool☆113Updated 3 years ago
- ☆65Updated 6 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆59Updated 11 years ago
- GPUDirect Async support for IB Verbs☆133Updated 3 years ago
- Graph500 reference implementations☆181Updated 3 years ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆91Updated last year
- ☆63Updated last year
- GPUfs - File system support for NVIDIA GPUs☆98Updated 6 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆42Updated 4 years ago
- Chai☆45Updated last week
- Blaze runtime system that support efficient accelerator integration for big data.☆24Updated 8 years ago
- Utilities to measure read access times of caches, memory, and hardware prefetches for simple and fused operations☆85Updated 2 years ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆80Updated 3 months ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆32Updated 3 years ago
- an API and runtime environment for data processing with MapReduce for shared-memory multi-core & multiprocessor systems.☆95Updated last year
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Updated 6 years ago
- A framework for pipelined computing on GPU☆30Updated 6 years ago
- OFI Programmer's Guide☆52Updated 2 years ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- ☆34Updated 3 years ago
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆36Updated 5 months ago
- A kernel module to enable RDMA transfers to/from IO, PFN and DAX mapped memory☆10Updated 10 years ago
- A host-based framework that transparently extends the GPU addressable global memory space beyond the host memory using NVM-backed data po…☆61Updated 5 years ago
- Automatic virtualization of (general) accelerators.☆45Updated 2 years ago
- The OpenDwarfs project provides a benchmark suite consisting of different computation/communication idioms, i.e., dwarfs, for state-of-ar…☆98Updated 6 years ago
- Memory System Microbenchmarks☆64Updated 2 years ago
- CUPTI GPU Profiler☆40Updated 6 years ago
- A Benchmark Suite for Heterogeneous System Computation☆54Updated 9 months ago