vgatherps / nontemporal_stores
Code used for generating charts and measurements of nontemporal stores
☆9Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for nontemporal_stores
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆27Updated 2 months ago
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 5 years ago
- ☆47Updated 5 years ago
- C++ interfaces for RDMA access☆47Updated 2 weeks ago
- Provides a set of benchmarks that can be used to measure the memory bandwidth performance of CPU's☆81Updated 7 months ago
- A low-overhead tool to periodically collect system-wide hardware performance counters on Intel64 systems.☆31Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆48Updated 6 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆33Updated 5 years ago
- Memory System Microbenchmarks☆61Updated last year
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆16Updated last year
- Stencil Probe - a stencil microbenchmark☆29Updated 11 years ago
- NUMA-Aware Reader-Writer Locks☆18Updated 10 years ago
- A Shared Memory Multithreaded Graph Benchmark Suite for Multicores☆34Updated 2 years ago
- A GPU FP32 computation method with Tensor Cores.☆18Updated 2 years ago
- ☆35Updated 4 months ago
- ☆17Updated 2 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- An attempt at achieving the theoretical best memory bandwidth of my machine.☆52Updated 11 years ago
- SYCL Reference Manual☆26Updated 6 months ago
- Evaluating different memory managers for dynamic GPU memory☆24Updated 3 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019☆52Updated 2 years ago
- Pointer-chasing memory benchmark (forked from Doug Pase's code).☆58Updated 10 years ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆41Updated this week
- User-space Page Management☆104Updated 3 months ago
- This is a mirror of the official libpfm4 git repository, https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ with some local branc…☆55Updated 3 weeks ago
- InstLatX64_Demo☆41Updated last week
- A GPU-Accelerated In-Memory Key-Value Store (AWS-focused fork)☆27Updated 7 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆29Updated 2 months ago
- Slides and exercises for persistent memory programming tutorial☆11Updated 2 years ago
- A Top-Down Profiler for GPU Applications☆13Updated 8 months ago