Collection of CUDA benchmarks, with a focus on unified vs. explicit memory management.
☆20Oct 15, 2019Updated 6 years ago
Alternatives and similar repositories for cuda-benchmarks
Users that are interested in cuda-benchmarks are comparing it to the libraries listed below
Sorting:
- nvidia TensorRT SSD implementation☆16May 15, 2018Updated 7 years ago
- Unit benchmarks of CUDA event APIs.☆17Apr 23, 2024Updated last year
- Rebuild YatSenOS On RISC-V 64.☆22Jan 6, 2022Updated 4 years ago
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 5 months ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 2 months ago
- ☆24Jun 24, 2022Updated 3 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 11 months ago
- ☆24Nov 14, 2023Updated 2 years ago
- ☆29Aug 13, 2019Updated 6 years ago
- cuASR: CUDA Algebra for Semirings☆45Aug 22, 2022Updated 3 years ago
- ☆30May 30, 2018Updated 7 years ago
- Gazebo simulation for CIR-KIT-Unit03☆11Jul 4, 2017Updated 8 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- Android Face Recognition uses Microsoft Project Oxford Face API for face detection and identification.☆13Nov 13, 2015Updated 10 years ago
- It's a project combined with hardware and software, the goal is to make a smart watch based on esp8266 chip. The smart watch has so many …☆10Jul 9, 2019Updated 6 years ago
- A library for hyperspectral image analysis using scikit-learn.☆10Apr 1, 2021Updated 4 years ago
- Holoplay.js compatible server for Linux (and probably other OSes)☆10Mar 27, 2019Updated 6 years ago
- Automatic virtualization of (general) accelerators.☆47Nov 28, 2022Updated 3 years ago
- A tool for examining GPU scheduling behavior.☆95Aug 17, 2024Updated last year
- Materials for the 2017 QMSS Python Workshop☆12Jun 22, 2017Updated 8 years ago
- This repository contains a YoloV4/Darknet based image classifier coded to run onboard the Nvidia Jetson Nano platform at approximately 10…☆13Aug 17, 2021Updated 4 years ago
- Experiments from our work Uncertainty Quantification and Deep Ensemble☆10Nov 1, 2021Updated 4 years ago
- ☆11Jun 9, 2023Updated 2 years ago
- ROS Package to estimate the variance of the inertial data from an IMU to be used to populate the error covariance matrix☆10Dec 13, 2015Updated 10 years ago
- ☆11Aug 21, 2023Updated 2 years ago
- OpenPose CNN model compatible with Huawei Ascend Atlas 200DK☆10Jun 23, 2019Updated 6 years ago
- A* with Artificial Terrain Cost for Search Space Restriction - ROS move_base integrated plugin☆13Jul 20, 2020Updated 5 years ago
- JNIEasy - Java Native Objects based on JNI☆10Aug 30, 2023Updated 2 years ago
- ☆11Sep 25, 2021Updated 4 years ago
- ROS node / nodelet for GenICam cameras☆11Sep 3, 2022Updated 3 years ago
- ☆10Jan 19, 2020Updated 6 years ago
- This is material to complement the FutureLearn MOOC on "Defensive programming and debugging", as well as for training purposes.☆12May 8, 2025Updated 9 months ago
- A lightweight and accurate point cloud clustering method☆11May 20, 2020Updated 5 years ago
- Yat another MySQL storage engine, a database course project.☆13Dec 23, 2022Updated 3 years ago
- Enables Jetson to be controlled with handpose using trt_pose☆12Mar 16, 2021Updated 4 years ago
- ROS wrappers for the V4R library☆10Oct 3, 2017Updated 8 years ago
- The Open GPU Server for CI purpose.☆15Feb 16, 2026Updated 2 weeks ago
- Correlated Low-rank Structure (CoLR) for Federated Recommendation System☆12May 22, 2024Updated last year
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago