Collection of CUDA benchmarks, with a focus on unified vs. explicit memory management.
☆20Oct 15, 2019Updated 6 years ago
Alternatives and similar repositories for cuda-benchmarks
Users that are interested in cuda-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unit benchmarks of CUDA event APIs.☆17Apr 23, 2024Updated last year
- nvidia TensorRT SSD implementation☆16May 15, 2018Updated 7 years ago
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- ☆12Oct 25, 2022Updated 3 years ago
- Rebuild YatSenOS On RISC-V 64.☆23Jan 6, 2022Updated 4 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- A library for hyperspectral image analysis using scikit-learn.☆10Apr 1, 2021Updated 4 years ago
- ☆12Feb 7, 2018Updated 8 years ago
- ☆12Jun 3, 2019Updated 6 years ago
- HPC Game Platform☆11Apr 20, 2023Updated 2 years ago
- D BLAS header. Works with OpenBLAS.☆13Mar 20, 2023Updated 3 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 11 months ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Aug 13, 2022Updated 3 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- Offline as of 2026-03-13☆15Mar 13, 2026Updated last week
- A C++ allocator based on cudaMallocManaged☆23Nov 19, 2018Updated 7 years ago
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- JNIEasy - Java Native Objects based on JNI☆10Aug 30, 2023Updated 2 years ago
- A Cinder apps that utilizes a deferred rendering engine to render lights and SSAO. There is also point-light shadow-mapping.☆32Jun 1, 2015Updated 10 years ago
- ☆25Jun 24, 2022Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- outline and links for PLDI 2022 tutorial☆17Jun 13, 2022Updated 3 years ago
- SYSU-ARCH is a LAB that focuses on the use and extending of simulators.☆10Dec 19, 2022Updated 3 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Aug 12, 2017Updated 8 years ago
- Simple starter CMake project that uses NVBench.☆16May 6, 2025Updated 10 months ago
- Surfel-based Mapping for 3d Laser Range Data (SuMa)☆12Mar 6, 2019Updated 7 years ago
- HTML/JS port of CUDA Occupancy Calculator☆17Nov 23, 2021Updated 4 years ago
- Benchmarking module for OCaml☆33Jan 31, 2025Updated last year
- A Netty DNS server. The original project, werkzeugkasten, can be found here http://code.google.com/p/werkzeugkasten/. This repo offers a …☆19Sep 16, 2017Updated 8 years ago
- ☆25Nov 14, 2023Updated 2 years ago
- Automatic virtualization of (general) accelerators.☆47Nov 28, 2022Updated 3 years ago
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Sep 12, 2022Updated 3 years ago
- ☆14Dec 23, 2025Updated 3 months ago
- This app for all kind of movie information. It's have everything about movie. I used The movie database API for all information . And I u…☆10Oct 22, 2020Updated 5 years ago
- Framework for simulating deficiencies and other aspects of the human visual system