gpauloski / BERT-PyTorch
BERT for Distributed PyTorch + AMP Training
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for BERT-PyTorch
- Some improvements on Adam☆28Updated 4 years ago
- This repository contains the results and code for the MLPerf™ Training v0.7 benchmark.☆56Updated last year
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 2 years ago
- A logging tool for deep learning.☆52Updated 2 years ago
- CUDA 12.2 HMM demos☆17Updated 3 months ago
- ☆36Updated last year
- ☆14Updated 2 years ago
- Distributed preprocessing and data loading for language datasets☆39Updated 7 months ago
- LLM-Inference-Bench☆11Updated last week
- SParse AcceleRation on Tensor Architecture☆17Updated last month
- A pytorch realization of adafactor (https://arxiv.org/pdf/1804.04235.pdf )☆24Updated 5 years ago
- Performance benchmarking with ColossalAI☆39Updated 2 years ago
- Sparsity support for PyTorch☆31Updated this week
- ☆16Updated 5 years ago
- A GPU performance prediction toolkit for CUDA programs☆16Updated 5 years ago
- Material for the SC21 Deep Learning at Scale Tutorial☆25Updated last year
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆11Updated 2 years ago
- MLPerf™ logging library☆30Updated this week
- ☆13Updated 3 years ago
- ☆22Updated 5 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆15Updated 3 years ago
- ☆11Updated 3 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆98Updated this week
- Personal collection of references for high performance mixed precision training.☆41Updated 5 years ago
- ☆55Updated 6 months ago
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- A parallel framework for training deep neural networks☆45Updated 3 weeks ago