EIFY / mup-vit
Everything you need to reproduce "Better plain ViT baselines for ImageNet-1k" in PyTorch, and more
☆9Updated this week
Alternatives and similar repositories for mup-vit
Users that are interested in mup-vit are comparing it to the libraries listed below
Sorting:
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- Easy Hypernetworks in Pytorch and Jax☆100Updated 2 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆71Updated 3 years ago
- FID computation in Jax/Flax.☆27Updated 10 months ago
- Python library for argument and configuration management☆54Updated 2 years ago
- PyTorch interface for TrueGrad Optimizers☆41Updated last year
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Implementation of Feedback Transformer in Pytorch☆106Updated 4 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Updated 3 years ago
- Refactoring dalle-pytorch and taming-transformers for TPU VM☆60Updated 3 years ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆77Updated 9 months ago
- ☆51Updated last year
- gpu tester detects broken and slow gpus in a cluster☆70Updated 2 years ago
- ☆33Updated 8 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Implementation of LogAvgExp for Pytorch☆36Updated last month
- MaskedTensors for PyTorch☆38Updated 2 years ago
- ☆89Updated 2 years ago
- Simple python template☆41Updated last year
- Train vision models using JAX and 🤗 transformers☆96Updated last month
- Mobile Viewer for W&B, built on top of Flutter.☆34Updated last year
- ☆22Updated 11 months ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- supporting pytorch FSDP for optimizers☆80Updated 5 months ago
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆127Updated 2 years ago