NVIDIA/nsight-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/nsight-training)

NVIDIA / nsight-training

Training material for Nsight developer tools

☆186

Alternatives and similar repositories for nsight-training

Users that are interested in nsight-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cwpearson / nvidia-performance-tools
View on GitHub
Instructions, Docker images, and examples for Nsight Compute and Nsight Systems
☆137May 19, 2020Updated 6 years ago
NVIDIA / nvbench
View on GitHub
CUDA Kernel Benchmarking Library
☆901Updated this week
Jokeren / GPA
View on GitHub
GPU Performance Advisor
☆66Jul 25, 2022Updated 3 years ago
zhuzilin / pytorch-malloc
View on GitHub
An external memory allocator example for PyTorch.
☆16Aug 10, 2025Updated 11 months ago
flashinfer-ai / debug-print
View on GitHub
Debug print operator for cudagraph debugging
☆18Aug 2, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
flashinfer-ai / cutlass-viz
View on GitHub
☆65Apr 26, 2025Updated last year
RRZE-HPC / gpu-benches
View on GitHub
collection of benchmarks to measure basic GPU capabilities
☆530Oct 24, 2025Updated 8 months ago
NVIDIA / compute-sanitizer-samples
View on GitHub
Samples demonstrating how to use the Compute Sanitizer Tools and Public API
☆99Nov 6, 2023Updated 2 years ago
NVIDIA / CUDALibrarySamples
View on GitHub
CUDA Library Samples
☆2,463Updated this week
L1aoXingyu / llm-infer-bench
View on GitHub
☆12Sep 1, 2023Updated 2 years ago
NVIDIA / multi-gpu-programming-models
View on GitHub
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆908Sep 26, 2025Updated 9 months ago
billmuch / matmul_perf_test
View on GitHub
☆15Apr 15, 2022Updated 4 years ago
FZJ-JSC / tutorial-multi-gpu
View on GitHub
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
☆380Jun 26, 2026Updated 3 weeks ago
tlc-pack / cutlass_fpA_intB_gemm
View on GitHub
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
☆96Jun 21, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
NCAR / HPC-Docs
View on GitHub
NCAR HPC Docs Repository
☆14Updated this week
argonne-lcf / alcl
View on GitHub
Argonne Leadership Computing Facility OpenCL tutorial
☆10Aug 22, 2025Updated 10 months ago
NVIDIA / NVTX
View on GitHub
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆547Updated this week
thallolang / thallo
View on GitHub
The Thallo DSL for Nonlinear Least Squares Optimization
☆19Aug 13, 2021Updated 4 years ago
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
kooyunmo / cuda-uvm-gpt2
View on GitHub
PyTorch-UVM on super-large language models.
☆17Dec 21, 2020Updated 5 years ago
CUDACommunity / CUDACommunityMeetup2021
View on GitHub
☆23Feb 16, 2022Updated 4 years ago
PSAL-POSTECH / accelsim_HMS
View on GitHub
☆12Jul 2, 2024Updated 2 years ago
NVIDIA / nvbandwidth
View on GitHub
A tool for bandwidth measurements on NVIDIA GPUs.
☆734Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
NVIDIA / cuda-samples
View on GitHub
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
☆9,406May 27, 2026Updated last month
AlphaSparse / Library
View on GitHub
A sparse BLAS lib supporting multiple backends
☆51Mar 18, 2026Updated 4 months ago
lemyx / tilelang-dsa
View on GitHub
DeepSeek-V3.2-Exp DSA Warmup Lightning Indexer training operator based on tilelang
☆47Nov 19, 2025Updated 8 months ago
OpenACCUserGroup / openacc_concept_strategies_book
View on GitHub
This repository contains application codes and solutions for the Book on "OpenACC for Programmers - Concept & Strategies".
☆34Feb 19, 2019Updated 7 years ago
NVIDIA / jitify
View on GitHub
A single-header C++ library for simplifying the use of CUDA Runtime Compilation (NVRTC).
☆573Sep 15, 2025Updated 10 months ago
FluidNumerics / gpu-programming
View on GitHub
A repository of codelabs and tutorials to support education in scientific computing
☆28Dec 16, 2023Updated 2 years ago
ROCm / TransformerEngine
View on GitHub
☆72Updated this week
WangYaohuii / CXL-SSD-Sim
View on GitHub
A Full-System Simulator for CXL-Based SSD Memory System
☆45Dec 24, 2024Updated last year
openhackathons-org / Profiling-AI-Software-Bootcamp
View on GitHub
This content discusses profiling with NVIDIA®️ Nsight™️ Systems, focusing on steps to optimize a distributed data-parallelism training st…
☆26Apr 23, 2026Updated 2 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hummingtree / cuda-graph-with-dynamic-parameters
View on GitHub
☆17Aug 9, 2022Updated 3 years ago
mit-han-lab / ncu-report-skill
View on GitHub
☆156May 24, 2026Updated last month
NVIDIA / cutlass
View on GitHub
CUDA Templates and Python DSLs for High-Performance Linear Algebra
☆10,104Updated this week
ThoenigAdrian / NeuralNetworksCudaTutorial
View on GitHub
Implement Neural Networks in Cuda from Scratch
☆23May 17, 2024Updated 2 years ago
NVIDIA / gdrcopy
View on GitHub
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
☆1,399Updated this week
KuangjuX / NVSHMEM-Tutorial
View on GitHub
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆195Feb 11, 2026Updated 5 months ago
mrnorman / miniWeatherML
View on GitHub
Exploring Machine Learning methods and workflows in a simplified weather model
☆19Jun 6, 2024Updated 2 years ago