lucidrains / logavgexp-torchLinks
Implementation of LogAvgExp for Pytorch
☆36Updated 3 months ago
Alternatives and similar repositories for logavgexp-torch
Users that are interested in logavgexp-torch are comparing it to the libraries listed below
Sorting:
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆91Updated 3 years ago
- ☆33Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆55Updated 2 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 2 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 3 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- ☆47Updated 2 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.☆73Updated 3 years ago
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Updated 2 years ago
- ☆41Updated 2 years ago
- PyTorch implementation of IRMAE https//arxiv.org/abs/2010.00679☆47Updated 3 years ago
- AdaCat☆49Updated 3 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆60Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆26Updated 3 years ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 10 months ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- ☆21Updated 2 years ago
- A simple Transformer where the softmax has been replaced with normalization☆20Updated 4 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- Official repository for MaGNET, ICLR 2022☆24Updated 2 years ago
- [NeurIPS 2021] Why Spectral Normalization Stabilizes GANs: Analysis and Improvements☆40Updated 2 years ago
- Official repository for our ICLR 2021 paper Evaluating the Disentanglement of Deep Generative Models with Manifold Topology☆36Updated 4 years ago