shyamsn97 / hyper-nn
Easy Hypernetworks in Pytorch and Jax
☆97Updated 2 years ago
Alternatives and similar repositories for hyper-nn:
Users that are interested in hyper-nn are comparing it to the libraries listed below
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- Transformers with doubly stochastic attention☆44Updated 2 years ago
- Contrastive Language-Image Pretraining☆142Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆62Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆102Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆135Updated 11 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆122Updated last year
- Pytorch-like dataloaders for JAX.☆73Updated 3 months ago
- ☆86Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆97Updated last year
- ☆33Updated last year
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆106Updated 2 years ago
- Code for the paper: Rotating Features for Object Discovery☆50Updated 6 months ago
- Fast Discounted Cumulative Sums in PyTorch☆95Updated 3 years ago
- Package for working with hypernetworks in PyTorch.☆121Updated last year
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- A simple library for scaling up JAX programs☆129Updated 3 months ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆104Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- ☆59Updated 3 years ago
- Flow-matching algorithms in JAX☆83Updated 6 months ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆246Updated last year
- Artificial Kuramoto Oscillatory Neurons☆50Updated this week
- ☆163Updated 2 years ago
- ☆15Updated 5 months ago
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆157Updated last year
- ☆37Updated last year
- ☆22Updated 3 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆82Updated last year
- ☆35Updated last year