shyamsn97 / hyper-nnLinks
Easy Hypernetworks in Pytorch and Jax
☆105Updated 2 years ago
Alternatives and similar repositories for hyper-nn
Users that are interested in hyper-nn are comparing it to the libraries listed below
Sorting:
- Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper☆81Updated 3 years ago
- ☆54Updated 2 years ago
- Contrastive Language-Image Pretraining☆144Updated 3 years ago
- Pretrained deep learning models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.☆259Updated 6 months ago
- LoRA for arbitrary JAX models and functions☆142Updated last year
- A collection of meta-learning algorithms in Jax☆23Updated 3 years ago
- Package for working with hypernetworks in PyTorch.☆129Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆183Updated last week
- ☆192Updated 3 months ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆106Updated 2 years ago
- FID computation in Jax/Flax.☆28Updated last year
- Implementing the Denoising Diffusion Probabilistic Model in Flax☆151Updated 2 years ago
- Implementations and checkpoints for ResNet, Wide ResNet, ResNeXt, ResNet-D, and ResNeSt in JAX (Flax).☆112Updated 3 years ago
- ☆91Updated 3 years ago
- ☆33Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆89Updated last year
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 8 months ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated last year
- ☆33Updated 10 months ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Unofficial JAX implementations of deep learning research papers☆156Updated 3 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- ☆41Updated 3 years ago
- Fast Discounted Cumulative Sums in PyTorch☆96Updated 4 years ago
- Rational Activation Functions - Replacing Padé Activation Units☆99Updated 6 months ago
- Parameter-Free Optimizers for Pytorch☆130Updated last year
- ☆164Updated 2 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Running Jax in PyTorch Lightning☆113Updated 9 months ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Updated 2 years ago