NERSC / sc22-dl-tutorial
Material for the SC22 Deep Learning at Scale Tutorial
☆41Updated last year
Alternatives and similar repositories for sc22-dl-tutorial:
Users that are interested in sc22-dl-tutorial are comparing it to the libraries listed below
- SC23 Deep Learning at Scale Tutorial Material☆43Updated 7 months ago
- SC24 Deep Learning at Scale Tutorial Material☆32Updated 2 months ago
- Material for the SC21 Deep Learning at Scale Tutorial☆25Updated 2 years ago
- PyTorch examples for NERSC systems☆32Updated 5 months ago
- Collection of small examples for running on ALCF resources☆17Updated 3 weeks ago
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆63Updated 5 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- Library for steering campaigns of simulations on supercomputers☆53Updated 2 weeks ago
- JAX bindings for the NVIDIA cuDecomp library☆32Updated 2 weeks ago
- Cosmic Tagging Network for Neutrino Physics☆13Updated 10 months ago
- ☆55Updated last year
- Reference implementations of MLPerf™ HPC training benchmarks☆47Updated 2 months ago
- scalable data movement in Exascale Supercomputers☆13Updated 2 weeks ago
- Guidelines on using Weights and Biases logging for deep learning applications on NERSC machines☆11Updated last year
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- This is a repository with examples to run inference endpoints on various ALCF clusters☆19Updated last week
- Legate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on …☆20Updated last week
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆30Updated 2 weeks ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆80Updated this week
- ☆21Updated 4 years ago
- SciML Benchmarking Suite for AI for Science☆39Updated 9 months ago
- This material contains content on how to profile and optimize simple Pytorch mnist code using NVIDIA Nsight Systems and Pytorch Profiler☆12Updated last year
- The JUBE benchmarking environment provides a script based framework to easily create benchmark sets, run those sets on different computer…☆37Updated 10 months ago
- ☆36Updated this week
- Stencil computations in JAX☆70Updated last year
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Updated 5 years ago
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆61Updated last month
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Updated 3 weeks ago
- OpenMP Tutorial☆9Updated 3 months ago
- CPU and GPU tutorial examples☆13Updated 3 weeks ago