shyamsn97 / hyper-nnLinks
Easy Hypernetworks in Pytorch and Jax
☆105Updated 2 years ago
Alternatives and similar repositories for hyper-nn
Users that are interested in hyper-nn are comparing it to the libraries listed below
Sorting:
- ☆54Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆105Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆206Updated 2 years ago
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- LoRA for arbitrary JAX models and functions☆141Updated last year
- ☆33Updated 2 years ago
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆151Updated 2 years ago
- Rational Activation Functions - Replacing Padé Activation Units☆101Updated 7 months ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆260Updated 7 months ago
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Running Jax in PyTorch Lightning☆112Updated 10 months ago
- ☆41Updated 3 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Updated 4 years ago
- Contrastive Language-Image Pretraining☆143Updated 3 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆114Updated 3 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆88Updated last year
- ☆89Updated 3 years ago
- ☆164Updated 2 years ago
- A minimalist implementation of score-based diffusion model☆129Updated 4 years ago
- Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch☆91Updated 2 years ago
- Non official implementation of the Linear Recurrent Unit (LRU, Orvieto et al. 2023)☆58Updated last month
- Parameter-Free Optimizers for Pytorch☆131Updated last year
- ☆120Updated 4 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 9 months ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆35Updated 3 years ago
- ☆192Updated 4 months ago