google-research / growneuronLinks
☆55Updated 10 months ago
Alternatives and similar repositories for growneuron
Users that are interested in growneuron are comparing it to the libraries listed below
Sorting:
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆34Updated 9 months ago
- Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance"☆32Updated 2 years ago
- ☆63Updated 3 years ago
- Codebase for Mechanistic Mode Connectivity☆14Updated last year
- ☆51Updated last year
- Official PyTorch implementation of "Rapid Neural Architecture Search by Learning to Generate Graphs from Datasets" (ICLR 2021)☆64Updated 10 months ago
- ☆41Updated 2 years ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆80Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated 2 years ago
- ☆23Updated 2 years ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆12Updated last year
- ☆51Updated 4 years ago
- NEVIS'22: Benchmarking the next generation of never-ending learners☆102Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆13Updated 3 months ago
- NF-Layers for constructing neural functionals.☆85Updated last year
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆16Updated 2 years ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆80Updated 10 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆54Updated last year
- ☆34Updated 9 months ago
- Recycling diverse models☆44Updated 2 years ago
- ☆53Updated 8 months ago
- [NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models☆45Updated 2 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆51Updated 6 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆61Updated 2 years ago