rom1504 / gpu-tester
gpu tester detects broken and slow gpus in a cluster
☆68Updated 2 years ago
Alternatives and similar repositories for gpu-tester:
Users that are interested in gpu-tester are comparing it to the libraries listed below
- Simple python template☆40Updated 10 months ago
- ☆76Updated 8 months ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆23Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆36Updated last year
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆86Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆96Updated last month
- ☆95Updated 9 months ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- ☆27Updated 4 years ago
- A library for unit scaling in PyTorch☆124Updated 3 months ago
- A JAX implementation of the continuous time formulation of Consistency Models☆84Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆60Updated 3 years ago
- Amos optimizer with JEstimator lib.☆81Updated 10 months ago
- JAX Synergistic Memory Inspector☆171Updated 8 months ago
- WIP☆93Updated 7 months ago
- ☆51Updated last year
- Python Research Framework☆106Updated 2 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 7 months ago
- Latent Diffusion Language Models☆68Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆48Updated last year
- A small demonstration of using WebDataset with ImageNet and PyTorch Lightning☆74Updated last year
- ☆64Updated 3 years ago
- ☆20Updated last year
- M4 experiment logbook☆57Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆126Updated 2 years ago
- Experiment of using Tangent to autodiff triton☆78Updated last year
- supporting pytorch FSDP for optimizers☆79Updated 3 months ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- JAX implementation ViT-VQGAN☆82Updated 2 years ago