cwpearson/nvidia-performance-tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cwpearson/nvidia-performance-tools)

cwpearson / nvidia-performance-tools

Instructions, Docker images, and examples for Nsight Compute and Nsight Systems

☆137

Alternatives and similar repositories for nvidia-performance-tools

Users that are interested in nvidia-performance-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NVIDIA / nsight-training
View on GitHub
Training material for Nsight developer tools
☆186Apr 27, 2026Updated 2 months ago
hummingtree / cuda-graph-with-dynamic-parameters
View on GitHub
☆17Aug 9, 2022Updated 3 years ago
NVIDIA / nvtx-plugins
View on GitHub
Python bindings for NVTX
☆67Jun 9, 2023Updated 3 years ago
gpgpu-sim / cutlass-gpgpu-sim
View on GitHub
☆28Oct 26, 2019Updated 6 years ago
GPUPeople / GPUMemManSurvey
View on GitHub
Evaluating different memory managers for dynamic GPU memory
☆26Dec 16, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
illinois-impact / gpu-algorithms-labs
View on GitHub
IMPACT GPU Algorithms Teaching Labs
☆60Apr 21, 2023Updated 3 years ago
leepoly / sm-profiler
View on GitHub
☆83Feb 5, 2026Updated 5 months ago
adwaitjog / mafia
View on GitHub
MAFIA: Multiple Application Framework for GPU architectures
☆28Jan 21, 2022Updated 4 years ago
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago
aschuh703 / ECE408
View on GitHub
☆54Dec 4, 2023Updated 2 years ago
mlcommons / training_results_v1.0
View on GitHub
This repository contains the results and code for the MLPerf™ Training v1.0 benchmark.
☆36Feb 23, 2024Updated 2 years ago
knotman90 / cuStreamComp
View on GitHub
Efficient CUDA Stream Compaction Library
☆34Jun 9, 2023Updated 3 years ago
VoVAllen / tf-dlpack
View on GitHub
DLPack for Tensorflow
☆34Apr 13, 2020Updated 6 years ago
xiuxiazhang / KeplerAs
View on GitHub
An Open Source Kepler GPU Assembler
☆22Jan 23, 2017Updated 9 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hpcgarage / cuASR
View on GitHub
cuASR: CUDA Algebra for Semirings
☆50Aug 22, 2022Updated 3 years ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
macsimgt / macsim-public
View on GitHub
Simulator for Heterogeneous Architecture
☆12Jan 12, 2016Updated 10 years ago
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆115Jun 28, 2025Updated last year
NVIDIA / nvbench
View on GitHub
CUDA Kernel Benchmarking Library
☆910Updated this week
NVIDIA / NVTX
View on GitHub
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆548Updated this week
AyakaGEMM / Hands-on-GEMM
View on GitHub
☆156Mar 18, 2024Updated 2 years ago
rishucoding / reproduce_isca23_cpu_DLRM_inference
View on GitHub
Sharing the codebase and steps for artifact evaluation for ISCA 2023 paper
☆16Feb 20, 2024Updated 2 years ago
sdsc / sdsc-summer-institute-2018
View on GitHub
SDSC Summer Institute 2018 Teaching Material
☆10Nov 25, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
daadaada / gas
View on GitHub
☆49Dec 11, 2020Updated 5 years ago
NVIDIA / cnmem
View on GitHub
A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory
☆298Nov 28, 2018Updated 7 years ago
yonsei-hpcp / pid-join
View on GitHub
☆12May 8, 2025Updated last year
decodecudabinary / Decoding-CUDA-Binary
View on GitHub
☆55Nov 21, 2019Updated 6 years ago
sillycross / Leiserchess---MIT-6.172-Fall16-Final-Project
View on GitHub
A fast implementation of Leiserchess AI for MIT 6.172`16 http://scrimmage.csail.mit.edu/
☆12Dec 22, 2016Updated 9 years ago
ValeevGroup / libintx
View on GitHub
☆25Nov 5, 2025Updated 8 months ago
yuanxinnn / APTMoE
View on GitHub
☆13Jun 29, 2024Updated 2 years ago
volkit / volkit
View on GitHub
Volume Manipulation Library
☆17Jul 13, 2023Updated 3 years ago
NERSC / timemory-tutorials
View on GitHub
Tutorials for Timemory
☆21Aug 1, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PAA-NCIC / PPoPP2017_artifact
View on GitHub
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆86Oct 8, 2019Updated 6 years ago
abdallah197 / llama2-from-scratch
View on GitHub
☆14Apr 26, 2024Updated 2 years ago
NVIDIA / multi-gpu-programming-models
View on GitHub
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆909Sep 26, 2025Updated 9 months ago
sunlex0717 / DissectingTensorCores
View on GitHub
☆114Apr 19, 2024Updated 2 years ago
ogiroux / freestanding
View on GitHub
☆72Jun 23, 2020Updated 6 years ago
NVlabs / ptxmemorymodel
View on GitHub
☆77May 29, 2019Updated 7 years ago
Xilinx / merlin-compiler
View on GitHub
☆63Aug 4, 2023Updated 2 years ago