NERSC / sc24-dl-tutorialLinks
SC24 Deep Learning at Scale Tutorial Material
☆33Updated last year
Alternatives and similar repositories for sc24-dl-tutorial
Users that are interested in sc24-dl-tutorial are comparing it to the libraries listed below
Sorting:
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆75Updated last month
- Collection of small examples for running on ALCF resources☆21Updated last month
- Guidelines on using Weights and Biases logging for deep learning applications on NERSC machines☆13Updated 2 years ago
- ALCF Computational Performance Workshop☆38Updated 3 years ago
- single-GPU to multi-GPU training of PyTorch apps at NERSC☆22Updated last year
- AI Training Series Material☆41Updated 4 months ago
- ☆49Updated 6 months ago
- SC23 Deep Learning at Scale Tutorial Material☆49Updated last year
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Updated 2 years ago
- Lecture and hands-on material for Track 8- Machine Learning of Argonne Training Program on Extreme-Scale Computing☆45Updated 5 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated 3 weeks ago
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆66Updated this week
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 11 months ago
- ☆80Updated last month
- Benchmarks☆17Updated 9 months ago
- ☆21Updated 5 years ago
- PyTorch examples for NERSC systems☆34Updated last year
- A PyTorch native platform for training generative AI models☆14Updated 2 months ago
- ☆137Updated 3 months ago
- COCCL: Compression and precision co-aware collective communication library☆30Updated 10 months ago
- An Online Deep Learning Interface for HPC programs on NVIDIA GPUs☆183Updated this week
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Updated last week
- CPU and GPU tutorial examples☆13Updated 10 months ago
- Python bindings for OpenSHMEM☆25Updated 3 weeks ago
- Asynchronous I/O for HDF5☆24Updated 2 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆17Updated last week
- A library to abstract between different lossless and lossy compressors☆35Updated 3 months ago
- Analyze graph/hierarchical performance data using pandas dataframes☆118Updated 3 months ago
- JUPITER Benchmark Suite☆23Updated 6 months ago