facebookresearch / projUNNLinks
Fast training of unitary deep network layers from low-rank updates
☆28Updated 2 years ago
Alternatives and similar repositories for projUNN
Users that are interested in projUNN are comparing it to the libraries listed below
Sorting:
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- ☆51Updated last year
- ☆31Updated 7 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Updated 2 years ago
- ☆51Updated last year
- ☆53Updated 8 months ago
- The Energy Transformer block, in JAX☆58Updated last year
- ☆37Updated 3 years ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆13Updated 3 months ago
- FID computation in Jax/Flax.☆27Updated 11 months ago
- ☆43Updated 3 weeks ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆11Updated last year
- Blog post☆17Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 4 years ago
- Open source code for EigenGame.☆30Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆44Updated 2 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- flexible meta-learning in jax☆14Updated last year
- ☆32Updated 8 months ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 11 months ago
- A selection of neural network models ported from torchvision for JAX & Flax.☆44Updated 4 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆112Updated 3 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- ☆55Updated 10 months ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- ☆16Updated 2 years ago
- ☆26Updated 2 years ago