gpauloski / BERT-PyTorchLinks
BERT for Distributed PyTorch + AMP Training
☆12Updated 2 years ago
Alternatives and similar repositories for BERT-PyTorch
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
Sorting:
- Fast SGEMM emulation on Tensor Cores☆17Updated 11 months ago
- ALCF Computational Performance Workshop☆38Updated 3 years ago
- ☆11Updated 10 months ago
- ☆17Updated 2 months ago
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 4 months ago
- CPU and GPU tutorial examples☆13Updated 10 months ago
- High-Performance Linpack Benchmark adopted version for GPU backend☆12Updated 3 years ago
- Acceleration codes for the Ozaki-scheme on integer matrix multiplication units.☆17Updated last month
- ☆12Updated 6 months ago
- ext_mpi_collectives☆11Updated 10 months ago
- Pragmatic, Productive, and Portable Affinity for HPC☆51Updated 3 weeks ago
- Benchmark implementation of CosmoFlow in TensorFlow Keras☆22Updated 2 years ago
- ☆49Updated 6 months ago
- automatic GPU offload for scientific libraries☆16Updated 3 weeks ago
- OpenVINO LLM Benchmark☆11Updated 2 years ago
- Memory Topology for GPUs☆17Updated 2 months ago
- ALCF Systems User Documentation☆29Updated this week
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated last week
- Reference implementation for the climate segmentation benchmark, based on the Exascale Deep Learning for Climate Analytics work☆10Updated 5 years ago
- Tools to run and parse MKL verbose mode☆18Updated 3 years ago
- Sparsity support for PyTorch☆38Updated 10 months ago
- A GPU performance prediction toolkit for CUDA programs☆18Updated 6 years ago
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆18Updated last year
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Updated 2 years ago
- OpenMP offload playground☆10Updated last year
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆75Updated last month
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆18Updated 7 months ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆17Updated 2 years ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Updated 5 months ago
- ExaWorks SDK☆11Updated 2 years ago