mcbal / deep-implicit-attentionLinks
Implementation of deep implicit attention in PyTorch
☆65Updated 4 years ago
Alternatives and similar repositories for deep-implicit-attention
Users that are interested in deep-implicit-attention are comparing it to the libraries listed below
Sorting:
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Efficient Householder Transformation in PyTorch☆66Updated 4 years ago
- ☆50Updated 4 years ago
- General Invertible Transformations for Flow-based Generative Models☆18Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆75Updated last year
- Experiments for Meta-Learning Symmetries by Reparameterization☆56Updated 4 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- Pytorch implementation of the Power Spherical distribution☆74Updated last year
- Monotone operator equilibrium networks☆53Updated 5 years ago
- ☆54Updated last year
- JAX exponential map normalising flows on sphere☆17Updated 4 years ago
- ☆100Updated 3 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 4 years ago
- ICML 2020 Paper: Latent Variable Modelling with Hyperbolic Normalizing Flows☆54Updated 2 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 5 years ago
- [NeurIPS 2020] Neural Manifold Ordinary Differential Equations (https://arxiv.org/abs/2006.10254)☆119Updated 2 years ago
- Riemannian Convex Potential Maps☆67Updated 2 years ago
- Stochastic Automatic Differentiation library for PyTorch.☆205Updated 11 months ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 5 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Convex potential flows☆83Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated last month
- [NeurIPS'19] Deep Equilibrium Models Jax Implementation☆40Updated 4 years ago
- Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"☆80Updated 3 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆87Updated 2 years ago
- Differentiable Algorithms and Algorithmic Supervision.☆115Updated 2 years ago
- ☆63Updated last year