harrism / cuda_event_benchmark
Unit benchmarks of CUDA event APIs.
☆17Updated 9 months ago
Alternatives and similar repositories for cuda_event_benchmark:
Users that are interested in cuda_event_benchmark are comparing it to the libraries listed below
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- A task benchmark☆41Updated 6 months ago
- Official BOLT Repository☆28Updated 6 months ago
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆50Updated last year
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- ☆24Updated this week
- ☆68Updated 4 years ago
- mallocMC: Memory Allocator for Many Core Architectures☆54Updated last week
- ☆30Updated last week
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- A GPU FP32 computation method with Tensor Cores.☆20Updated 2 years ago
- Directed Acyclic Graph Execution Engine (DAGEE) is a C++ library that enables programmers to express computation and data movement, as ta…☆45Updated 3 years ago
- Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019