shyamsn97 / hyper-nnLinks
Easy Hypernetworks in Pytorch and Jax
☆106Updated 2 years ago
Alternatives and similar repositories for hyper-nn
Users that are interested in hyper-nn are comparing it to the libraries listed below
Sorting:
- ☆56Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 4 years ago
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆263Updated 9 months ago
- ☆33Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆106Updated 3 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆188Updated last week
- Sequence Modeling with Structured State Spaces☆67Updated 3 years ago
- ☆41Updated 3 years ago
- ☆35Updated last year
- ☆164Updated 2 years ago
- ☆192Updated 6 months ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆118Updated 3 years ago
- Contrastive Language-Image Pretraining☆144Updated 3 years ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- ☆36Updated last year
- Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…☆173Updated 6 months ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Updated 2 years ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆61Updated 3 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- Use Jax functions in Pytorch☆259Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆143Updated last year
- Package for working with hypernetworks in PyTorch.☆131Updated 2 years ago
- ☆56Updated last year
- Transformers with doubly stochastic attention☆51Updated 3 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 4 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆156Updated 3 years ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77Updated 2 years ago