BERT for Distributed PyTorch + AMP Training
☆12Mar 15, 2023Updated 3 years ago
Alternatives and similar repositories for BERT-PyTorch
Users that are interested in BERT-PyTorch are comparing it to the libraries listed below
Sorting:
- a tool to generate skeleton applications that mimic a real applications' parallel or distributed performance at a task level☆13Jan 11, 2017Updated 9 years ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- ☆11Aug 8, 2021Updated 4 years ago
- ☆19Jan 17, 2024Updated 2 years ago
- automatic GPU offload for scientific libraries☆16Mar 10, 2026Updated last week
- Tools to run and parse MKL verbose mode☆18Jun 28, 2022Updated 3 years ago
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- Compiler toolchain to enable generation of high-level DSLs for geophysical fluid dynamics models☆29Mar 22, 2023Updated 3 years ago
- Get started with your NVIDIA Arm HPC Developers Kit!☆33Feb 16, 2023Updated 3 years ago
- Build and run container environment for LFRic☆10Jan 8, 2024Updated 2 years ago
- Working on this project as part of 3D-Vision course at ETH☆11Jul 20, 2020Updated 5 years ago
- Distributed Communication-Optimal Shuffle and Transpose Algorithm☆14Feb 20, 2026Updated last month
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Build and deploy stateful agents across federated resources☆92Updated this week
- Communication Avoiding Numerical Dense Matrix Computations☆11Dec 20, 2020Updated 5 years ago
- ALCF Computational Performance Workshop☆38Oct 7, 2022Updated 3 years ago
- Standalone mini-app of the ECMWF cloud microphysics parameterization☆11Feb 24, 2026Updated 3 weeks ago
- Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await …☆16Jul 11, 2023Updated 2 years ago
- https://eth-cscs.github.io/uenv/☆12Nov 18, 2024Updated last year
- The fftMPI library performs 2d/3d FFTs in parallel for grids distributed across MPI processes.☆14Jun 6, 2022Updated 3 years ago
- code for experiments in Grosse and Salakhutdinov, 2015.☆12Oct 9, 2016Updated 9 years ago
- Regularization, Neural Network Training Dynamics☆14Jan 13, 2020Updated 6 years ago
- A 2D Hydro code for benchmarking purpose☆20Feb 12, 2024Updated 2 years ago
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆16Oct 21, 2020Updated 5 years ago
- A multigrid package in Julia: smoothed aggregation AMG + geometric multigrid.☆19Sep 30, 2025Updated 5 months ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- Multi-GPU training with TensorFlow on Piz Daint☆12Nov 23, 2021Updated 4 years ago
- ☆15Oct 15, 2025Updated 5 months ago
- The ECMWF wave model ecWAM☆17Mar 5, 2026Updated 2 weeks ago
- Library for steering campaigns of simulations on supercomputers☆61Jun 2, 2025Updated 9 months ago
- Proof of Concept: a C-callable GPU-enabled parallel 2-D heat diffusion solver written in Julia using CUDA, MPI and graphics☆24Nov 13, 2020Updated 5 years ago
- PyTorch training at CSCS☆20Jul 4, 2025Updated 8 months ago
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- Simple, lightweight transformers in Fortran☆17Nov 17, 2023Updated 2 years ago
- Code for paper "Beyond Closure Models: Learning Chaotic Systems via Physics-Informed Neural Operators".☆16Dec 24, 2025Updated 2 months ago
- SODECL is a library of ordinary differential equation (ODE) and stochastic differential equation (SDE) solvers in OpenCL.☆11Jul 4, 2020Updated 5 years ago
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆75Dec 17, 2025Updated 3 months ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆27Mar 4, 2026Updated 2 weeks ago
- Argonne Leadership Computing Facility OpenCL tutorial☆10Aug 22, 2025Updated 7 months ago