This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler
☆21Apr 23, 2026Updated last week
Alternatives and similar repositories for Profiling-AI-Software-Bootcamp
Users that are interested in Profiling-AI-Software-Bootcamp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Profiling with NVIDIA Nsight Tools Bootcamp☆23Feb 4, 2026Updated 2 months ago
- AIBench, a tool for comparing and evaluating AI serving solutions. forked from [tsbs](https://github.com/timescale/tsbs) and adapted to A…☆20Sep 4, 2024Updated last year
- Instanciate the Cache Aware Roofline Model on single socket and multisocket systems.☆27Feb 22, 2019Updated 7 years ago
- This repository contains code used to prepare the LUMIERE Glioblastoma dataset.☆35Feb 14, 2024Updated 2 years ago
- ☆27Feb 13, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Final Project for Parallel Computing at CMU (15-618/15-418)☆10May 13, 2016Updated 9 years ago
- Learning neural network embeddings in hyperbolic spaces☆14Dec 4, 2019Updated 6 years ago
- Python Tools for the POP Metrics☆13Feb 16, 2022Updated 4 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- DeepSphere: a graph-based spherical CNN (TensorFlow)☆15Jan 1, 2021Updated 5 years ago
- A library for exporting models including NeMo and Hugging Face to optimized inference backends, and deploying them for efficient querying☆33Apr 23, 2026Updated last week
- SPIRO is a Smart Plate Imaging Robot☆11Feb 23, 2026Updated 2 months ago
- A tutorial to set up a running compute cluster on cloud resources☆11Jul 7, 2023Updated 2 years ago
- Simple Arm assembly kernels for testing the performance and functionality of Arm CPUs.☆16Dec 3, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- E4S for Spack☆38Nov 23, 2025Updated 5 months ago
- Julia bindings for NVTX, for instrumenting with the Nvidia Nsight Systems profiler☆39Apr 1, 2026Updated 3 weeks ago
- The CSCS ReFrame test suite☆15Updated this week
- Lecture notes of the course Analysis On Manifolds☆23Apr 2, 2026Updated 3 weeks ago
- Source code for the software implementation of SeGraM proposed in our ISCA 2022 paper: Senol Cali et. al., "SeGraM: A Universal Hardware …☆12Nov 3, 2022Updated 3 years ago
- A Deep Learning Beginner: Nvidia's End to End Learning on Steering for Self-Driving Cars☆12Apr 13, 2022Updated 4 years ago
- Ochami deployment recipes☆13Apr 23, 2026Updated last week
- Recipes for software stacks on Alps vClusters.☆15Apr 23, 2026Updated last week
- Meshcapade support for Unreal Editor for Fortnite (UEFN)☆22Apr 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ST-Hadoop is an open-source MapReduce extension of Hadoop designed specially to analyze your spatio-temporal data efficiently☆23Mar 1, 2019Updated 7 years ago
- gpuPairHMM: Ultra-fast GPU-based PairHMM for DNA Variant Calling☆15Nov 6, 2025Updated 5 months ago
- EPCC I/O benchmarking applications☆12Dec 15, 2021Updated 4 years ago
- JAX exponential map normalising flows on sphere☆17Oct 4, 2020Updated 5 years ago
- ☆20Oct 11, 2023Updated 2 years ago
- Modular, object-based analysis for ImageJ/Fiji☆16Mar 31, 2026Updated 3 weeks ago
- ☆57Updated this week
- SC23 Deep Learning at Scale Tutorial Material☆49Sep 16, 2024Updated last year
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Cocoa wrapper around Unix sockets based on NetSocket by Dustin Mierau.☆27Mar 6, 2019Updated 7 years ago
- This repository contains the results and code for the MLPerf™ Training v4.0 benchmark.☆12Jun 11, 2024Updated last year
- Scripts for fine-tuning an HPC Code LLM☆16Jul 19, 2024Updated last year
- Eastern European Machine Learning Summer School (EEML) Workshop Series 2022. Tutorial on Causality for the Serbian Machine Learning Works…☆21May 7, 2022Updated 3 years ago
- Synthetic coordinates for GNNs, as proposed in "Directional Message Passing on Molecular Graphs via Synthetic Coordinates" (NeurIPS 2021)☆32Apr 26, 2023Updated 3 years ago
- Linux Cross-Memory Attach☆23Apr 21, 2026Updated last week
- A package for magnetic field extrapolation.☆14Apr 23, 2026Updated last week