shyamsn97 / hyper-nnLinks
Easy Hypernetworks in Pytorch and Jax
☆103Updated 2 years ago
Alternatives and similar repositories for hyper-nn
Users that are interested in hyper-nn are comparing it to the libraries listed below
Sorting:
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆254Updated 3 months ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆150Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆140Updated last year
- Sequence Modeling with Structured State Spaces☆65Updated 2 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆104Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆112Updated 3 years ago
- Contrastive Language-Image Pretraining☆143Updated 2 years ago
- ☆53Updated 2 years ago
- ☆33Updated 2 years ago
- ☆90Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆179Updated last month
- A minimalist implementation of score-based diffusion model☆127Updated 3 years ago
- ☆192Updated 2 weeks ago
- A collection of meta-learning algorithms in Jax☆23Updated 2 years ago
- ☆110Updated last month
- ☆31Updated 7 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆84Updated last year
- FID computation in Jax/Flax.☆28Updated 11 months ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 3 years ago
- A simple, easy-to-understand library for diffusion models using Flax and Jax. Includes detailed notebooks on DDPM, DDIM, and EDM with sim…☆29Updated 2 months ago
- Transformers with doubly stochastic attention☆46Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆124Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆90Updated last month
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated last year
- Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"☆196Updated 2 years ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago