shyamsn97 / hyper-nn
Easy Hypernetworks in Pytorch and Jax
☆100Updated 2 years ago
Alternatives and similar repositories for hyper-nn:
Users that are interested in hyper-nn are comparing it to the libraries listed below
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆123Updated last year
- ☆33Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- ☆50Updated 2 years ago
- NF-Layers for constructing neural functionals.☆82Updated last year
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆63Updated 2 years ago
- Implementation of PSGD optimizer in JAX☆30Updated 3 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆158Updated last year
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆104Updated 3 years ago
- AdaCat☆49Updated 2 years ago
- Contrastive Language-Image Pretraining☆142Updated 2 years ago
- The Official PyTorch Implementation of "VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models" (ICLR 2021 spotlight…☆55Updated 2 years ago
- Meta Optimal Transport☆102Updated last year
- ☆36Updated last year
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Updated last year
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆87Updated 2 years ago
- ☆37Updated last year
- Code for our paper "Generative Flow Networks for Discrete Probabilistic Modeling"☆82Updated 2 years ago
- ☆49Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- Code for the paper: Rotating Features for Object Discovery☆50Updated 8 months ago
- Gradient-based constrained optimization for JAX☆30Updated 2 years ago
- Minimal JAX/Flax port of `lpips` supporting `vgg16`, with pre-trained weights stored in the 🤗 Hugging Face hub.☆15Updated 2 years ago
- ☆173Updated 4 months ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- Unofficial JAX implementations of deep learning research papers☆154Updated 2 years ago
- ☆38Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆136Updated last year