Johswald / awesome-hypernetworks
☆60Updated 3 years ago
Alternatives and similar repositories for awesome-hypernetworks:
Users that are interested in awesome-hypernetworks are comparing it to the libraries listed below
- Package for working with hypernetworks in PyTorch.☆122Updated last year
- Continual Learning with Hypernetworks. A continual learning approach that has the flexibility to learn a dedicated set of parameters, fin…☆163Updated 2 years ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆56Updated last year
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆25Updated last year
- ☆54Updated 8 months ago
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆31Updated 7 months ago
- Official implementation of Transformer Neural Processes☆72Updated 2 years ago
- ☆65Updated 3 months ago
- ☆28Updated 8 months ago
- NF-Layers for constructing neural functionals.☆82Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago
- Code for GFlowNet-EM, a novel algorithm for fitting latent variable models with compositional latents and an intractable true posterior.☆41Updated last year
- ☆38Updated 4 months ago
- Deep Learning & Information Bottleneck☆58Updated last year
- [ICML'21] Improved Contrastive Divergence Training of Energy Based Models☆62Updated 2 years ago
- Parallelizing non-linear sequential models over the sequence length☆51Updated 2 months ago
- Beyond Straight-Through☆94Updated last year
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆87Updated last year
- Simple CIFAR10 ResNet example with JAX.☆23Updated 3 years ago
- Code to reproduce the results for Compositional Attention☆60Updated 2 years ago
- Codebase for Mechanistic Mode Connectivity☆13Updated last year
- Pytorch code for "Improving Self-Supervised Learning by Characterizing Idealized Representations"☆40Updated 2 years ago
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.☆16Updated 5 months ago
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- Code to implement the AND-mask and geometric mean to do gradient based optimization, from the paper "Learning explanations that are hard …☆39Updated 4 years ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆50Updated 4 months ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Repo to accompany paper on Meta Learning with Implicit Gradients (NeurIPS 2019)☆57Updated 5 years ago
- ICML 2022: Learning Iterative Reasoning through Energy Minimization☆45Updated 2 years ago