Convert nvprof profiles into about:tracing compatible JSON files
☆73Apr 9, 2021Updated 5 years ago
Alternatives and similar repositories for nvprof2json
Users that are interested in nvprof2json are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch ucc plugin☆23Jul 8, 2021Updated 4 years ago
- Prototype routines for GPU quantization written using PyTorch.☆21Apr 15, 2026Updated 2 months ago
- Torch FFI-bindings for NNPACK☆31May 26, 2017Updated 9 years ago
- MaskedTensors for PyTorch☆40Jul 17, 2022Updated 3 years ago
- A tracing JIT compiler for PyTorch☆14Dec 11, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Dec 9, 2022Updated 3 years ago
- Strassen's Algorithm for Tensor Contraction☆15Jul 7, 2017Updated 8 years ago
- ROCm Machine Learning and HPC Stack installer☆30Jul 31, 2020Updated 5 years ago
- ☆21Mar 3, 2025Updated last year
- ☆27Oct 26, 2019Updated 6 years ago
- Playground for some RNN stuff in Torch.☆21Aug 12, 2015Updated 10 years ago
- OpenCL tool to detect buffer overflows in GPU kernels☆23Jan 7, 2019Updated 7 years ago
- TORCH_TRACE parser for PT2☆86May 11, 2026Updated last month
- ☆12Oct 19, 2014Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ext_mpi_collectives☆11Jun 3, 2026Updated 2 weeks ago
- An Open-Source Community Supported Fortran layer for AMD HIP☆10May 20, 2020Updated 6 years ago
- Hacks for PyTorch☆19Apr 18, 2023Updated 3 years ago
- NCCL Fast Socket is a transport layer plugin to improve NCCL collective communication performance on Google Cloud.☆124Nov 15, 2023Updated 2 years ago
- A repository holding the slides and short information from my presentations at different events☆11Jul 25, 2025Updated 10 months ago
- metaprogramming for Julia arrays☆13Sep 26, 2020Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆154May 28, 2026Updated 2 weeks ago
- ☆13Oct 23, 2018Updated 7 years ago
- A port of shadowsocks via websockets protocol.☆10Feb 1, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Zsh patched to support Actually Portable Executables git://git.code.sf.net/p/zsh/code (upstream pending)☆16Jan 26, 2021Updated 5 years ago
- La plataforma de código abierto para la gestión de reportes ciudadanos.☆19Jul 18, 2017Updated 8 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 8 years ago
- mount hfs (HFS, HFS+, HFSX) usb/sd on iOS without iFile☆11Sep 9, 2015Updated 10 years ago
- Intercepting CUDA runtime calls with LD_PRELOAD☆43Mar 11, 2014Updated 12 years ago
- Notes and toy codes...☆11Jul 5, 2019Updated 6 years ago
- Collective communications library with various primitives for multi-machine training.☆1,430Updated this week
- (DEPRECATED, migrated to main repo - hasktorch/hasktorch) Research code generation / FFI binding using libtorch 1.x for the next Hasktor…☆11Sep 13, 2019Updated 6 years ago
- PyTorch bindings for CUTLASS grouped GEMM.☆153May 29, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- This version of Chombo is fortran-free and depends on the Proto middleware infrastructure for performance portability.☆10Sep 12, 2025Updated 9 months ago
- Kingul: Korean Keyboard for Kindle E-readers☆14Jan 26, 2025Updated last year
- Hikaru no Go, GBA translation☆14Sep 15, 2017Updated 8 years ago
- ⛔️ DEPRECATED - System for AUtomated Code Evaluation☆26Jun 18, 2020Updated 5 years ago
- ☆10Jul 16, 2016Updated 9 years ago
- A small mod board for the GDP Pocket to add a tiny internal USB hub for various purposes.☆13Oct 10, 2017Updated 8 years ago
- Drop-in library for tracking the memory allocations of CUDA applications☆14Nov 17, 2017Updated 8 years ago