google-research / growneuronLinks
☆55Updated last year
Alternatives and similar repositories for growneuron
Users that are interested in growneuron are comparing it to the libraries listed below
Sorting:
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆36Updated last year
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- ☆51Updated last year
- ☆19Updated 3 years ago
- A centralized place for deep thinking code and experiments☆86Updated 2 years ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆46Updated 2 years ago
- ☆52Updated last year
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆50Updated 3 months ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆62Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago
- ☆65Updated 3 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆49Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆38Updated 2 years ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated 2 years ago
- Easy Hypernetworks in Pytorch and Jax☆105Updated 2 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆53Updated 9 months ago
- Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance"☆32Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- ☆41Updated 2 years ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 2 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated 3 months ago
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆66Updated 3 years ago
- ☆38Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆85Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆81Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago