openhackathons-org / AI-ProfilerLinks
This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler
☆14Updated 2 years ago
Alternatives and similar repositories for AI-Profiler
Users that are interested in AI-Profiler are comparing it to the libraries listed below
Sorting:
- Material for the SC21 Deep Learning at Scale Tutorial☆26Updated 2 years ago
- A parallel framework for training deep neural networks☆62Updated 4 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Updated 2 years ago
- N-Ways to GPU Programming Bootcamp☆92Updated 9 months ago
- Machine Learning for HPC Workflows☆137Updated this week
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆23Updated 2 weeks ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Updated this week
- ☆25Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆17Updated 6 years ago
- Princeton mini course on GPUs in Python☆40Updated 9 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆169Updated this week
- E4S for Spack☆33Updated last month
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆17Updated last year
- AI Training Series Material☆37Updated 9 months ago
- N-Ways to Multi-GPU Programming☆37Updated 2 years ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆66Updated last week
- Training materials provided by OpenACC.org.☆93Updated 11 months ago
- An Aspiring Drop-In Replacement for Pandas at Scale☆74Updated 3 years ago
- Cosmic Tagging Network for Neutrino Physics☆13Updated last year
- Python Loop Replacement with NumPy and PyTorch - Fancy Slicing, UFuncs and equivalent, Aggregations, Sorting and more☆16Updated 9 months ago
- ☆48Updated last month
- The CUDA target for Numba☆149Updated last week
- ☆14Updated last year
- CPU and GPU tutorial examples☆13Updated 3 months ago
- Lecture slides, codes and materials for both the basic and advanced "Foundations of HPC" courses @UniTS, "Data Science and Scientific Com…☆35Updated last year
- scikit-learn_bench benchmarks various implementations of machine learning algorithms across data analytics frameworks. It currently suppo…☆118Updated last month
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆17Updated last week
- MLPerf™ logging library☆37Updated 3 months ago