Ongoing research training transformer language models at scale, including: BERT & GPT-2
☆17Feb 24, 2026Updated last week
Alternatives and similar repositories for Megatron-DeepSpeed
Users that are interested in Megatron-DeepSpeed are comparing it to the libraries listed below
Sorting:
- Cosmic Tagging Network for Neutrino Physics☆13Jun 26, 2024Updated last year
- ☆49Jul 17, 2025Updated 7 months ago
- Liquid Argon Computer Vision☆12Dec 4, 2025Updated 3 months ago
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- A lattice QCD library.☆16Feb 10, 2026Updated 3 weeks ago
- Distributed Training of Bayesian Neural Networks at Scale☆11May 26, 2020Updated 5 years ago
- ☆21Nov 10, 2020Updated 5 years ago
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆75Dec 17, 2025Updated 2 months ago
- ☆20May 9, 2023Updated 2 years ago
- A tracing infrastructure for heterogeneous computing applications.☆40Updated this week
- All code related to scraping, parsing, cleaning, and processing data used by PEC☆17Nov 5, 2024Updated last year
- Materials Science Understanding Large Language Model☆24Feb 10, 2026Updated 3 weeks ago
- Example applications for the Department of Energy Computational Science Graduate Fellowship☆20Sep 11, 2025Updated 5 months ago
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆62Dec 17, 2025Updated 2 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Jul 14, 2023Updated 2 years ago
- ALCF Computational Performance Workshop☆38Oct 7, 2022Updated 3 years ago
- This is a repository with examples to run inference endpoints on various ALCF clusters☆27Feb 3, 2026Updated last month
- ☆44Updated this week
- Scaling RLLib for generic simulation environments on Theta☆20Feb 16, 2023Updated 3 years ago
- This is repository for a I/O benchmark which represents Scientific Deep Learning Workloads.☆23Dec 6, 2022Updated 3 years ago
- Train across all your devices, ezpz 🍋☆26Updated this week
- ☆78Updated this week
- ☆57Nov 18, 2025Updated 3 months ago
- Repository to host supporting information and code samples for Accelerated DFT☆37Apr 29, 2025Updated 10 months ago
- Run Llama 2 using MLX on macOS☆34Dec 18, 2023Updated 2 years ago
- PyTorch examples for NERSC systems☆34Oct 28, 2024Updated last year
- 详细双语注释版word2vec源码,well-annotated word2vec☆10Oct 3, 2021Updated 4 years ago
- ☆28Dec 3, 2025Updated 3 months ago
- A benchmark suite for measuring HDF5 performance.☆43Feb 24, 2026Updated last week
- AI Training Series Material☆40Oct 2, 2025Updated 5 months ago
- ☆11Jun 15, 2018Updated 7 years ago
- MATLAB code for Stein Point Markov Chain Monte Carlo.☆13Jul 3, 2019Updated 6 years ago
- Literate Python package development with Jupyter☆12Aug 18, 2025Updated 6 months ago
- A quarto extension for writing teaching practicals☆12Sep 21, 2025Updated 5 months ago
- SYCL* Templates for Linear Algebra (SYCL*TLA) - SYCL based CUTLASS implementation for Intel GPUs☆67Updated this week
- A Python MDSplus Thin Client Implementation☆17Sep 25, 2025Updated 5 months ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 3 weeks ago
- [Developmental] Quarto Extension to Enable Google Colaboratory Links with Quarto Documents☆15May 18, 2025Updated 9 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated this week