linusericsson / einspace
Official code for our NeurIPS 2024 paper "einspace: Searching for Neural Architectures from Fundamental Operations"
☆28Updated 5 months ago
Alternatives and similar repositories for einspace:
Users that are interested in einspace are comparing it to the libraries listed below
- [ICLR2024] Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How☆32Updated 5 months ago
- ☆13Updated 2 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆32Updated 7 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated last year
- ☆22Updated 2 years ago
- ☆146Updated last year
- ☆52Updated 6 months ago
- Deep Networks Grok All the Time and Here is Why☆34Updated 11 months ago
- HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models☆18Updated 4 months ago
- Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆79Updated 8 months ago
- ☆11Updated 2 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆47Updated last year
- ☆49Updated last year
- Model Zoos published at the NeurIPS 2022 Dataset & Benchmark track: "Model Zoos: A Dataset of Diverse Populations of Neural Network Model…☆55Updated last year
- ☆17Updated 7 months ago
- ☆47Updated last year
- NAS + Cascades | Best Paper @ GECCO 2022☆16Updated last year
- ☆51Updated 10 months ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆65Updated 6 months ago
- Deep Learning & Information Bottleneck☆60Updated last year
- Meta-Album meta-dataset for few-shot image classification☆24Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆33Updated 5 months ago
- Modern Fixed Point Systems using Pytorch☆89Updated last year
- Parallelizing non-linear sequential models over the sequence length☆51Updated 3 months ago
- ☆61Updated 3 years ago
- ☆35Updated 2 years ago
- Fast training of unitary deep network layers from low-rank updates☆28Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated last month