jxbz / nero
👑 Pytorch code for the Nero optimiser.
☆20Updated 2 years ago
Alternatives and similar repositories for nero:
Users that are interested in nero are comparing it to the libraries listed below
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- ☆68Updated last year
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 4 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆24Updated 4 years ago
- AdaCat☆49Updated 2 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆44Updated 3 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆107Updated 2 years ago
- High performance pytorch modules☆18Updated 2 years ago
- Dive into Jax, Flax, XLA and C++☆31Updated 4 years ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Updated 3 years ago
- Official repository for our ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology☆35Updated 3 years ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- A small library for creating and manipulating custom JAX Pytree classes☆56Updated 2 years ago
- Image augmentation library for Jax☆37Updated 10 months ago
- A simple Transformer where the softmax has been replaced with normalization☆19Updated 4 years ago
- ☆39Updated 2 years ago
- ☆30Updated 3 years ago
- Texture mapping with variational auto-encoders☆40Updated 3 years ago
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Updated 2 years ago
- A framework for implementing equivariant DL☆10Updated 3 years ago
- 👩 Pytorch and Jax code for the Madam optimiser.☆51Updated 4 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 4 months ago
- ☆24Updated 6 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- ☆33Updated 4 years ago