KurochkinAlexey / AntisymmetricRNNLinks
Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"
☆15Updated 6 years ago
Alternatives and similar repositories for AntisymmetricRNN
Users that are interested in AntisymmetricRNN are comparing it to the libraries listed below
Sorting:
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆50Updated 4 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Updated 4 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Updated last year
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆50Updated 3 months ago
- Implementation of deep implicit attention in PyTorch☆65Updated 4 years ago
- ☆12Updated 3 years ago
- Recursive Bayesian Networks☆11Updated 4 months ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Blog post☆17Updated last year
- Bayesian Attention Modules☆35Updated 4 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆63Updated 5 years ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 4 years ago
- Code for "Training Deep Energy-Based Models with f-Divergence Minimization" ICML 2020☆36Updated 2 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆34Updated 3 years ago
- Official code for UnICORNN (ICML 2021)☆28Updated 4 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Monotone operator equilibrium networks☆53Updated 5 years ago
- Official Release of "Learning the Stein Discrepancy for Training and Evaluating Energy-Based Models without Sampling"☆49Updated 5 years ago
- Featurized Density Ratio Estimation☆20Updated 4 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- ☆24Updated 5 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Updated 5 years ago
- ☆54Updated last year
- ☆37Updated 4 years ago
- General Invertible Transformations for Flow-based Generative Models☆18Updated 4 years ago
- ☆50Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆105Updated 4 years ago
- Approximate Inference Turns Deep Networks into Gaussian Processes (dnn2gp)☆48Updated 5 years ago
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 6 years ago