mcbal / deep-implicit-attention
Implementation of deep implicit attention in PyTorch
☆65Updated 3 years ago
Alternatives and similar repositories for deep-implicit-attention:
Users that are interested in deep-implicit-attention are comparing it to the libraries listed below
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- Transformers with doubly stochastic attention☆45Updated 2 years ago
- Code for the Thermodynamic Variational Objective☆26Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated 9 months ago
- Riemannian Convex Potential Maps☆67Updated 2 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 3 years ago
- JAX exponential map normalising flows on sphere☆17Updated 4 years ago
- ☆49Updated 4 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 4 years ago
- Structured matrices for compressing neural networks☆66Updated last year
- Monotone operator equilibrium networks☆52Updated 4 years ago
- ICML 2020 Paper: Latent Variable Modelling with Hyperbolic Normalizing Flows☆53Updated 2 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆24Updated 4 years ago
- [NeurIPS 2020] Neural Manifold Ordinary Differential Equations (https://arxiv.org/abs/2006.10254)☆118Updated last year
- Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch☆91Updated 4 years ago
- Normalizing Flows in Jax☆107Updated 4 years ago
- ☆30Updated 4 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆22Updated 5 years ago
- Pytorch implementation of the Power Spherical distribution☆74Updated 9 months ago
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆39Updated 4 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22Updated 6 years ago
- codebase for "A Theory of the Inductive Bias and Generalization of Kernel Regression and Wide Neural Networks"☆49Updated 2 years ago
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Updated 2 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Codebase for Learning Invariances in Neural Networks☆95Updated 2 years ago
- Lie Algebra Convolutional Network implementation☆44Updated 3 years ago
- Stochastic Normalizing Flows☆76Updated 3 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- ☆53Updated 9 months ago