Gladys-Zhao / mRNN-mLSTMLinks
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Updated 5 years ago
Alternatives and similar repositories for mRNN-mLSTM
Users that are interested in mRNN-mLSTM are comparing it to the libraries listed below
Sorting:
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Updated 6 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 6 years ago
- Implementation of Mogrifier LSTM in PyTorch☆34Updated 5 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 4 years ago
- ☆40Updated 7 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated 8 months ago
- A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch☆78Updated 5 years ago
- ☆33Updated 4 years ago
- Repository for Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification (NeurIPS 2024)☆44Updated last year
- ☆24Updated last year
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆51Updated 7 months ago
- ☆12Updated 3 years ago
- [ICLR 2022] "Bayesian Modeling and Uncertainty Quantification for Learning to Optimize: What, Why, and How" by Yuning You, Yue Cao, Tianl…☆14Updated 3 years ago
- An adaptive training algorithm for residual network☆17Updated 5 years ago
- Uncertainty on Asynchronous Time Event Prediction (Spotlight, Neurips 2019)☆19Updated 5 years ago
- Code for Reparameterizable Subset Sampling via Continuous Relaxations, IJCAI 2019.☆57Updated 2 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Updated 2 years ago
- ☆28Updated 6 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- Recursive Bayesian Networks☆11Updated 8 months ago
- ☆24Updated 5 years ago
- Blog post☆17Updated last year
- ☆59Updated 5 years ago
- ☆21Updated 5 years ago
- Pytorch Implemetation for our NAACL2019 Paper "Riemannian Normalizing Flow on Variational Wasserstein Autoencoder for Text Modeling" http…☆63Updated 5 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆35Updated 4 years ago
- ☆24Updated 8 months ago
- ☆13Updated 4 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Updated 5 years ago
- code for Explicit Sparse Transformer☆61Updated 2 years ago