lucas-maes / nano-simsiamLinks
Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features distributed training, real-time KNN eval, and AMP. Perfect for research prototyping.
☆19Updated 7 months ago
Alternatives and similar repositories for nano-simsiam
Users that are interested in nano-simsiam are comparing it to the libraries listed below
Sorting:
- ☆31Updated 7 months ago
- An implementation of the Llama architecture, to instruct and delight☆21Updated 3 weeks ago
- Implementation of PSGD optimizer in JAX☆33Updated 5 months ago
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 3 weeks ago
- Graph neural networks in JAX.☆67Updated last year
- ☆17Updated 10 months ago
- Jax/Flax rewrite of Karpathy's nanoGPT☆57Updated 2 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆112Updated 3 years ago
- ☆60Updated 3 years ago
- Implementation of Denoising Diffusion Probabilistic Models (DDPM) in JAX and Flax.☆20Updated last year
- AdaCat☆49Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆34Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Updated 7 months ago
- A JAX implementation of stochastic addition.☆14Updated 2 years ago
- Lightning-like training API for JAX with Flax☆41Updated 6 months ago
- ☆20Updated last year
- ☆54Updated 11 months ago
- Turn jitted jax functions back into python source code☆22Updated 6 months ago
- ☆104Updated 2 weeks ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- ☆32Updated last year
- Open source code for EigenGame.☆30Updated 2 years ago
- ☆33Updated 2 years ago
- Tensor Parallelism with JAX + Shard Map☆11Updated last year
- flexible meta-learning in jax☆14Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆16Updated 5 months ago