shyamsn97 / hyper-nn
Easy Hypernetworks in Pytorch and Jax
☆98Updated 2 years ago
Alternatives and similar repositories for hyper-nn:
Users that are interested in hyper-nn are comparing it to the libraries listed below
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆80Updated 3 years ago
- LoRA for arbitrary JAX models and functions☆135Updated last year
- FID computation in Jax/Flax.☆27Updated 8 months ago
- ☆33Updated 2 years ago
- ☆36Updated last year
- [ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)☆158Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆108Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆63Updated 2 years ago
- ☆112Updated last month
- Contrastive Language-Image Pretraining☆142Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Pytorch-like dataloaders for JAX.☆76Updated 5 months ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆104Updated 3 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆122Updated last year
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆249Updated last year
- The 2D discrete wavelet transform for JAX☆40Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆204Updated last year
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- Meta Optimal Transport☆100Updated last year
- Gaussian-Bernoulli Restricted Boltzmann Machines☆102Updated 2 years ago
- Implementation of PSGD optimizer in JAX☆28Updated 2 months ago
- NF-Layers for constructing neural functionals.☆82Updated last year
- Automatically take good care of your preemptible TPUs☆36Updated last year
- Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload☆126Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆169Updated 3 months ago
- ☆88Updated 2 years ago
- Running Jax in PyTorch Lightning☆89Updated 3 months ago
- Run PyTorch in JAX. 🤝☆231Updated last month