NERSC / sc22-dl-tutorialLinks
Material for the SC22 Deep Learning at Scale Tutorial
☆41Updated last year
Alternatives and similar repositories for sc22-dl-tutorial
Users that are interested in sc22-dl-tutorial are comparing it to the libraries listed below
Sorting:
- Material for the SC21 Deep Learning at Scale Tutorial☆25Updated 2 years ago
- SC24 Deep Learning at Scale Tutorial Material☆32Updated 4 months ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- SC23 Deep Learning at Scale Tutorial Material☆45Updated 8 months ago
- Collection of small examples for running on ALCF resources☆19Updated last week
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆63Updated 7 months ago
- JAX bindings for the NVIDIA cuDecomp library☆36Updated last month
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 3 months ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆21Updated last year
- scalable data movement in Exascale Supercomputers☆16Updated last month
- ☆55Updated last year
- Library for steering campaigns of simulations on supercomputers☆55Updated this week
- ☆21Updated 4 years ago
- ☆36Updated last month
- PyTorch examples for NERSC systems☆32Updated 7 months ago
- Cosmic Tagging Network for Neutrino Physics☆13Updated 11 months ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆81Updated 3 weeks ago
- ☆19Updated 6 years ago
- This is a repository with examples to run inference endpoints on various ALCF clusters☆20Updated last week
- CPU and GPU tutorial examples☆13Updated 2 months ago
- Guidelines on using Weights and Biases logging for deep learning applications on NERSC machines☆13Updated last year
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆31Updated 2 months ago
- ☆123Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Updated last month
- COCCL: Compression and precision co-aware collective communication library☆22Updated 2 months ago
- Lecture and hands-on material for Track 8- Machine Learning of Argonne Training Program on Extreme-Scale Computing☆37Updated 9 months ago
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Updated 5 years ago
- Numbast is a tool to build an automated pipeline that converts CUDA APIs into Numba bindings.☆47Updated last week
- PyTorch training at CSCS☆15Updated this week
- Dragon distributed runtime for HPC and AI applications and workflows☆72Updated 2 weeks ago