AllanYangZhou / universal_neural_functionalLinks
☆53Updated last year
Alternatives and similar repositories for universal_neural_functional
Users that are interested in universal_neural_functional are comparing it to the libraries listed below
Sorting:
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆99Updated 3 weeks ago
- The Energy Transformer block, in JAX☆58Updated last year
- NF-Layers for constructing neural functionals.☆90Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- ☆120Updated 4 months ago
- ☆58Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- Minimal but scalable implementation of large language models in JAX☆35Updated last month
- A simple library for scaling up JAX programs☆144Updated 11 months ago
- ☆34Updated 11 months ago
- Implementation of PSGD optimizer in JAX☆35Updated 9 months ago
- ☆34Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- ☆53Updated last year
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆48Updated 2 years ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 4 months ago
- ☆73Updated last year
- ☆32Updated 7 months ago
- ☆194Updated 2 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- seqax = sequence modeling + JAX☆168Updated 3 months ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆168Updated 4 months ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆68Updated 11 months ago
- Solving the Abstraction & Reasoning Corpus with DreamCoder☆53Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated 11 months ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆95Updated 10 months ago