ezyang / nvprof2jsonLinks

Convert nvprof profiles into about:tracing compatible JSON files

☆70

Alternatives and similar repositories for nvprof2json

Users that are interested in nvprof2json are comparing it to the libraries listed below

Sorting:

NVIDIA / nvtx-plugins
Python bindings for NVTX
☆66Updated 2 years ago
dmlc / nnvm-fusion
Kernel Fusion and Runtime Compilation Based on NNVM
☆70Updated 8 years ago
pytorch / tensorpipe
A tensor-aware point-to-point communication primitive for machine learning
☆259Updated 2 years ago
daadaada / turingas
Assembler for NVIDIA Volta and Turing GPUs
☆226Updated 3 years ago
bshillingford / python-cuda-profile
☆34Updated 8 years ago
parasj / checkmate
Training neural networks in TensorFlow 2.0 with 5x less memory
☆132Updated 3 years ago
google-research / sputnik
A library of GPU kernels for sparse matrix operations.
☆270Updated 4 years ago
bwasti / pytorch_compiler_tutorial
Codebase associated with the PyTorch compiler tutorial
☆46Updated 5 years ago
albanD / subclass_zoo
☆171Updated last year
Funatiq / gossip
gossip: Efficient Communication Primitives for Multi-GPU Systems
☆59Updated 3 years ago
deep500 / deep500
A Deep Learning Meta-Framework and HPC Benchmarking Library
☆81Updated 3 years ago
tensorflow / mlir-hlo
☆420Updated this week
awslabs / raf
☆145Updated 6 months ago
jiazhihao / metaflow_sysml19
Repository for SysML19 Artifacts Evaluation
☆54Updated 6 years ago
cmu-catalyst / collage
System for automated integration of deep learning backends.
☆47Updated 2 years ago
intel / torch-ccl
oneCCL Bindings for Pytorch*
☆99Updated 3 weeks ago
tqchen / ffi-navigator
☆241Updated this week
mmperf / mmperf
MatMul Performance Benchmarks for a Single CPU Core comparing both hand engineered and codegen kernels.
☆134Updated last year
jansel / pytorch-jit-paritybench
☆40Updated 7 months ago
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆345Updated this week
openucx / torch-ucc
pytorch ucc plugin
☆23Updated 4 years ago
henline / streamexecutordoc
Documentation for StreamExecutor open source proposal
☆83Updated 9 years ago
openxla / shardy
MLIR-based partitioning system
☆114Updated this week
tlc-pack / relax
☆196Updated 2 years ago
VoVAllen / tf-dlpack
DLPack for Tensorflow
☆35Updated 5 years ago
tbd-ai / tbd-suite
☆47Updated 2 years ago
linnanwang / superneurons-release
this is the release repository of superneurons
☆52Updated 4 years ago
awslabs / lorien
☆43Updated last year
NVlabs / NVBit
☆270Updated last month
xmartlabs / cuda-calculator
Online CUDA Occupancy Calculator
☆79Updated 3 years ago