conceptofmind / vit-flax
Implementation of numerous Vision Transformers in Google's JAX and Flax.
☆20Updated 2 years ago
Alternatives and similar repositories for vit-flax:
Users that are interested in vit-flax are comparing it to the libraries listed below
- FID computation in Jax/Flax.☆26Updated 6 months ago
- AdaCat☆49Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆47Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- An implementation of PSGD Kron second-order optimizer for PyTorch☆29Updated 3 weeks ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- PyTorch interface for TrueGrad Optimizers☆41Updated last year
- Implementation of Vision Transformers in Flax☆18Updated 4 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12Updated 3 years ago
- This is a port of Mistral-7B model in JAX☆30Updated 6 months ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆35Updated last year
- JAX implementation of Learning to learn by gradient descent by gradient descent☆26Updated 3 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆87Updated 7 months ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Train vision models using JAX and 🤗 transformers☆97Updated this week
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆22Updated last week
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆121Updated 9 months ago
- A simple library for scaling up JAX programs☆129Updated 2 months ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆86Updated last year
- EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax☆127Updated last year
- Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.☆17Updated last year
- Texture mapping with variational auto-encoders☆40Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 6 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆73Updated 6 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆49Updated 6 months ago