srvm/cupti_profiler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/srvm/cupti_profiler)

srvm / cupti_profiler

CUPTI GPU Profiler

☆39

Alternatives and similar repositories for cupti_profiler

Users that are interested in cupti_profiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sderek / CUDAAdvisor
View on GitHub
CUDAAdvisor: a GPU profiling tool
☆53Aug 24, 2018Updated 7 years ago
JohndeVostok / APE
View on GitHub
A GPU FP32 computation method with Tensor Cores.
☆27Dec 8, 2025Updated 7 months ago
SJTU-IPADS / ugache
View on GitHub
☆24Oct 31, 2023Updated 2 years ago
GVProf / GVProf
View on GitHub
GVProf: A Value Profiler for GPU-based Clusters
☆54Mar 24, 2024Updated 2 years ago
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JamesTheZ / VersaPipe
View on GitHub
A framework for pipelined computing on GPU
☆30Jul 17, 2019Updated 7 years ago
harrism / cuda_event_benchmark
View on GitHub
Unit benchmarks of CUDA event APIs.
☆17Apr 23, 2024Updated 2 years ago
prideout / camera_demo
View on GitHub
demo for par_camera_control.h
☆11Nov 22, 2022Updated 3 years ago
owensgroup / SlabAlloc
View on GitHub
A dynamic GPU memory allocator, suitable for warp synchronized scenarios.
☆11Aug 20, 2019Updated 6 years ago
gty111 / SimpleUseGpgpuSim
View on GitHub
GPGPU-SIM 使用篇
☆14Nov 12, 2022Updated 3 years ago
pfnet-research / menoh-rs
View on GitHub
Rust binding for Menoh
☆14Jan 29, 2019Updated 7 years ago
nvidia-compiler-sdk / nvvmir-samples
View on GitHub
☆74Jun 29, 2023Updated 3 years ago
yalue / cuda_scheduling_examiner_mirror
View on GitHub
A tool for examining GPU scheduling behavior.
☆96Aug 17, 2024Updated last year
hcho3 / relayviz
View on GitHub
Visualize TVM Relay program graph
☆12Nov 19, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
vinjn / chihuahua
View on GitHub
🐶chihuahua - tiny & fast rendering library
☆13Aug 9, 2016Updated 9 years ago
Nelson-Cheung / yatsenos-riscv
View on GitHub
Rebuild YatSenOS On RISC-V 64.
☆23Jan 6, 2022Updated 4 years ago
uuudown / Tartan
View on GitHub
Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite
☆72Sep 12, 2018Updated 7 years ago
helloall1900 / pynvx
View on GitHub
Python bindings for NVIDIA CUDA APIs.
☆14Mar 2, 2024Updated 2 years ago
google / nvidia_libs_test
View on GitHub
Tests and benchmarks for cudnn (and in the future, other nvidia libraries)
☆55Nov 20, 2020Updated 5 years ago
ekondis / gpuroofperf-toolkit
View on GitHub
A GPU performance prediction toolkit for CUDA programs
☆18Mar 25, 2019Updated 7 years ago
NVlabs / NVBit
View on GitHub
☆341Apr 6, 2026Updated 3 months ago
seb-v / amd_challenge_solutions
View on GitHub
☆19Jun 6, 2025Updated last year
vinjn / GpuProf
View on GitHub
Realtime GPU Profiler for AMD / NVIDIA / Intel GPUs
☆38Aug 12, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
xnd-project / cuda-benchmarks
View on GitHub
Collection of CUDA benchmarks, with a focus on unified vs. explicit memory management.
☆21Oct 15, 2019Updated 6 years ago
zabetak / slides
View on GitHub
A repository holding the slides and short information from my presentations at different events
☆11Jul 25, 2025Updated 11 months ago
getianao / ngAP
View on GitHub
ngAP's artifact for ASPLOS'24
☆25Jul 29, 2025Updated 11 months ago
pmodels / bolt
View on GitHub
Official BOLT Repository
☆33Aug 16, 2024Updated last year
graphitemaster / 0xABAD1DEA
View on GitHub
Static global objects with constructors and destructors made useful in C++
☆29Jul 29, 2016Updated 9 years ago
KernelTuner / kernel_launcher
View on GitHub
Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner
☆22Sep 12, 2025Updated 10 months ago
ariasanovsky / ptx-parser
View on GitHub
☆11Jun 9, 2023Updated 3 years ago
dholt / kvm-gpu
View on GitHub
Instructions for enabling GPU passthrough in KVM
☆15Feb 6, 2020Updated 6 years ago
tomaszkuczewski / OptixDenoiserUtils
View on GitHub
Example implementation of new Optix 7.0.0 denoiser feature
☆14Dec 11, 2019Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
cwida / teseo
View on GitHub
A C++ library for the analysis of structural dynamic graphs
☆27Jun 14, 2022Updated 4 years ago
xindzju / vscode-awesome-snippets
View on GitHub
The most complete C/C++ snippets extension for VS Code
☆19Jun 6, 2021Updated 5 years ago
ROCm / rocprofiler-sdk
View on GitHub
[DEPRECATED] Moved to ROCm/rocm-systems repo
☆30May 28, 2026Updated last month
facebookresearch / HolisticTraceAnalysis
View on GitHub
A library to analyze PyTorch traces.
☆535May 29, 2026Updated last month
quettabit / convolution_kernel
View on GitHub
Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.
☆14Dec 8, 2017Updated 8 years ago
proyectosdeley / proyectos_de_ley
View on GitHub
Aplicación para mostrar los proyectos de ley emitidos por el Congreso
☆11Jul 26, 2020Updated 5 years ago