mcbal / deep-implicit-attentionLinks
Implementation of deep implicit attention in PyTorch
☆65Updated 4 years ago
Alternatives and similar repositories for deep-implicit-attention
Users that are interested in deep-implicit-attention are comparing it to the libraries listed below
Sorting:
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- ☆50Updated 5 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 5 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆39Updated 3 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 4 years ago
- Very deep VAEs in JAX/Flax☆46Updated 4 years ago
- Efficient Householder Transformation in PyTorch☆69Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆111Updated 4 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆77Updated last year
- General Invertible Transformations for Flow-based Generative Models☆18Updated 5 years ago
- [NeurIPS 2020] Neural Manifold Ordinary Differential Equations (https://arxiv.org/abs/2006.10254)☆125Updated 2 years ago
- A minimal implementation of a VAE with BinConcrete (relaxed Bernoulli) latent distribution in TensorFlow.☆22Updated 6 years ago
- Experiment code for "Randomized Automatic Differentiation"☆67Updated 5 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆89Updated 3 years ago
- Monotone operator equilibrium networks☆54Updated 5 years ago
- Pytorch implementation of the Power Spherical distribution☆75Updated last year
- Official implementation of the paper "Topographic VAEs learn Equivariant Capsules"☆81Updated 3 years ago
- ☆100Updated 4 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Updated 5 years ago
- ICML 2020 Paper: Latent Variable Modelling with Hyperbolic Normalizing Flows☆54Updated 3 years ago
- Riemannian Convex Potential Maps☆67Updated 2 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆58Updated 4 years ago
- Study on the applicability of Direct Feedback Alignment to neural view synthesis, recommender systems, geometric learning, and natural la…☆89Updated 3 years ago
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Updated 3 years ago
- ☆33Updated 2 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆66Updated 3 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆51Updated 7 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆67Updated 3 years ago
- Neural Turing Machines in pytorch☆49Updated 4 years ago