AllanYangZhou / universal_neural_functional
☆49Updated last year
Alternatives and similar repositories for universal_neural_functional:
Users that are interested in universal_neural_functional are comparing it to the libraries listed below
- Deep Networks Grok All the Time and Here is Why☆34Updated 11 months ago
- A centralized place for deep thinking code and experiments☆83Updated last year
- ☆52Updated 7 months ago
- The Energy Transformer block, in JAX☆57Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- Scalable and Stable Parallelization of Nonlinear RNNS☆15Updated 3 months ago
- Implementation of PSGD optimizer in JAX☆33Updated 4 months ago
- ☆64Updated 10 months ago
- Minimal but scalable implementation of large language models in JAX☆34Updated 6 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- ☆53Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 11 months ago
- ☆37Updated last year
- ☆43Updated last month
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- ☆26Updated last year
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆79Updated 9 months ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 5 months ago
- nanoGPT-like codebase for LLM training☆94Updated last month
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆31Updated 3 years ago
- A MAD laboratory to improve AI architecture designs 🧪☆113Updated 4 months ago
- 🧱 Modula software package☆188Updated last month
- ☆31Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆47Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated this week
- ☆30Updated 5 months ago
- Code for the paper "Function-Space Learning Rates"☆19Updated 2 weeks ago
- Sparse and discrete interpretability tool for neural networks☆61Updated last year
- ☆78Updated 10 months ago