NVIDIA / nsight-vscode-editionLinks

A Visual Studio Code extension for building and debugging CUDA applications.

☆87

Alternatives and similar repositories for nsight-vscode-edition

Users that are interested in nsight-vscode-edition are comparing it to the libraries listed below

Sorting:

NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆346Updated this week
NVIDIA / NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆434Updated this week
NVIDIA / nsight-training
Training material for Nsight developer tools
☆163Updated last year
NVIDIA / compute-sanitizer-samples
Samples demonstrating how to use the Compute Sanitizer Tools and Public API
☆85Updated last year
NVIDIA / nvbench
CUDA Kernel Benchmarking Library
☆696Updated this week
NVIDIA / multi-gpu-programming-models
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
☆768Updated 5 months ago
NVIDIA / cuda-gdb
CUDA GDB
☆210Updated 3 months ago
NVIDIA / nvbandwidth
A tool for bandwidth measurements on NVIDIA GPUs.
☆504Updated 3 months ago
PatWie / cuda-design-patterns
Some CUDA design patterns and a bit of template magic for CUDA
☆156Updated 2 years ago
NVlabs / cub
THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.
☆84Updated last year
FZJ-JSC / tutorial-multi-gpu
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
☆287Updated last month
NVIDIA / cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
☆600Updated this week
KernelTuner / kernel_tuner
Kernel Tuner
☆357Updated 2 weeks ago
leimao / CUDA-GEMM-Optimization
CUDA Matrix Multiplication Optimization
☆214Updated last year
ROCm / aotriton
Ahead of Time (AOT) Triton Math Library
☆75Updated this week
gpuocelot / gpuocelot
GPUOcelot: A dynamic compilation framework for PTX
☆207Updated 6 months ago
wangzyon / NVIDIA_SGEMM_PRACTICE
Step-by-step optimization of CUDA SGEMM
☆363Updated 3 years ago
ROCm / hipBLAS
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆145Updated this week
cwpearson / nvidia-performance-tools
Instructions, Docker images, and examples for Nsight Compute and Nsight Systems
☆131Updated 5 years ago
NVIDIA / cuCollections
☆566Updated this week
CUDA-Tutorial / CodeSamples
Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"
☆91Updated last year
puttsk / cuda-tutorial
A set of hands-on tutorials for CUDA programming
☆230Updated last year
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 4 months ago
meta-pytorch / tritonparse
TritonParse: A Compiler Tracer, Visualizer, and mini-Reproducer(WIP) for Triton Kernels
☆139Updated this week
uxlfoundation / oneCCL
oneAPI Collective Communications Library (oneCCL)
☆241Updated this week
ROCm / Tensile
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆247Updated this week
iree-org / iree-nvgpu
☆50Updated last year
ROCm / rocm_bandwidth_test
Bandwidth test for ROCm
☆63Updated this week
intel / torch-xpu-ops
☆50Updated this week
oneapi-src / SYCLomatic
☆265Updated this week