AllanYangZhou / universal_neural_functionalLinks
☆56Updated last year
Alternatives and similar repositories for universal_neural_functional
Users that are interested in universal_neural_functional are comparing it to the libraries listed below
Sorting:
- ☆122Updated 7 months ago
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated last month
- NF-Layers for constructing neural functionals.☆93Updated 2 years ago
- A centralized place for deep thinking code and experiments☆89Updated 2 years ago
- ☆35Updated last year
- ☆82Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- ☆62Updated last year
- A simple library for scaling up JAX programs☆144Updated 2 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Implementation of PSGD optimizer in JAX☆35Updated last year
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆28Updated 2 months ago
- ☆233Updated 11 months ago
- ☆53Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- Open source code for EigenGame.☆34Updated 2 years ago
- Code for minimum-entropy coupling.☆32Updated last week
- LoRA for arbitrary JAX models and functions☆143Updated last year
- ☆31Updated 9 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆87Updated last year
- ☆35Updated last year
- Parallelizing non-linear sequential models over the sequence length☆56Updated 6 months ago
- Supporting code for the blog post on modular manifolds.☆111Updated 3 months ago
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆81Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆60Updated 3 years ago
- ☆41Updated 3 years ago
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 4 years ago