lucas-maes / nano-simsiam
Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features distributed training, real-time KNN eval, and AMP. Perfect for research prototyping.
☆19Updated 5 months ago
Alternatives and similar repositories for nano-simsiam
Users that are interested in nano-simsiam are comparing it to the libraries listed below
Sorting:
- ☆60Updated 3 years ago
- ☆40Updated last year
- An implementation of the Llama architecture, to instruct and delight☆21Updated 4 months ago
- Implementation of PSGD optimizer in JAX☆33Updated 4 months ago
- ☆17Updated 8 months ago
- ☆32Updated 7 months ago
- ☆30Updated 5 months ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆29Updated 11 months ago
- ☆53Updated 7 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated this week
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆109Updated 2 years ago
- Parallel Associative Scan for Language Models☆18Updated last year
- Minimal but scalable implementation of large language models in JAX☆34Updated 6 months ago
- Running Jax in PyTorch Lightning☆100Updated 5 months ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆30Updated 2 weeks ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 3 months ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆46Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆81Updated 2 weeks ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated last year
- ☆31Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Graph neural networks in JAX.☆67Updated 10 months ago