rom1504 / gpu-tester
gpu tester detects broken and slow gpus in a cluster
☆67Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gpu-tester
- Simple python template☆40Updated 6 months ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆23Updated last year
- ☆73Updated 4 months ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆70Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆32Updated last year
- ☆57Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆46Updated 10 months ago
- Scalable and Performant Data Loading☆68Updated this week
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- Efficient optimizers☆87Updated this week
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆124Updated 2 years ago
- Contrastive Language-Image Pretraining☆143Updated 2 years ago
- ☆77Updated 5 months ago
- JAX implementation ViT-VQGAN☆77Updated 2 years ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆59Updated 4 months ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 2 years ago
- Named tensors with first-class dimensions for PyTorch☆322Updated last year
- Multidimensional indexing for tensors☆113Updated last year
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- M4 experiment logbook☆56Updated last year
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆84Updated 2 years ago
- MaskedTensors for PyTorch☆38Updated 2 years ago
- A library for unit scaling in PyTorch☆105Updated 2 weeks ago
- Latent Diffusion Language Models☆67Updated last year
- ☆64Updated 3 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆68Updated 3 months ago
- Experiment of using Tangent to autodiff triton☆72Updated 10 months ago
- PyTorch interface for TrueGrad Optimizers☆39Updated last year