SamsungSAILMontreal / ghn3Links

Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]

☆36

Alternatives and similar repositories for ghn3

Users that are interested in ghn3 are comparing it to the libraries listed below

Sorting:

gregorbachmann / scaling_mlps
☆51Updated last year
yilundu / comet
[NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts
☆62Updated 2 years ago
sjunhongshen / DASH
☆23Updated 2 years ago
google-deepmind / ssl_hsic
☆37Updated 11 months ago
yilundu / ebm_compositionality
[NeurIPS'20] Code for the Paper Compositional Visual Generation and Inference with Energy Based Models
☆45Updated 2 years ago
google-research / growneuron
☆55Updated 11 months ago
lucidrains / discrete-key-value-bottleneck-pytorch
Implementation of Discrete Key / Value Bottleneck, in Pytorch
☆88Updated 2 years ago
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated last year
AllanYangZhou / nfn
NF-Layers for constructing neural functionals.
☆87Updated last year
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
sjunhongshen / ORCA
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆71Updated last year
KellerJordan / REPAIR
Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair
☆48Updated last year
YannDubs / Invariant-Self-Supervised-Learning
Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"
☆41Updated 2 years ago
RobertCsordas / linear_layer_as_attention
The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …
☆16Updated last month
IDSIA / recurrent-fwp
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆49Updated last month
yilundu / improved_contrastive_divergence
[ICML'21] Improved Contrastive Divergence Training of Energy Based Models
☆63Updated 3 years ago
IdoAmos / not-from-scratch
☆32Updated 8 months ago
smonsays / contrastive-meta-learning
Code accompanying the paper "A contrastive rule for meta-learning"
☆12Updated 8 months ago
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
Optimization-AI / SogCLR
Stochastic Optimization for Global Contrastive Learning without Large Mini-batches
☆20Updated 2 years ago
aryol / inductive-scratchpad
Implementation for our paper "How Far Can Transformers Reason? The Locality Barrier and Inductive Scratchpad"
☆11Updated last year
krafton-ai / mambaformer-icl
MambaFormer in-context learning experiments and implementation for https://arxiv.org/abs/2402.04248
☆55Updated last year
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
mkofinas / neural-graphs
Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).
☆81Updated 11 months ago
SamsungSAILMontreal / nino
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]
☆19Updated last month
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆30Updated 2 years ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year