Gladys-Zhao / mRNN-mLSTMLinks
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Updated 4 years ago
Alternatives and similar repositories for mRNN-mLSTM
Users that are interested in mRNN-mLSTM are comparing it to the libraries listed below
Sorting:
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Updated 5 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- ☆40Updated 6 years ago
- Recursive Bayesian Networks☆11Updated last month
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆32Updated 3 years ago
- ☆10Updated 3 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated last month
- ☆33Updated 4 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 7 years ago
- ☆22Updated 3 years ago
- ☆24Updated 5 years ago
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- A pytorch implementation for the LSTM experiments in the paper: Why Gradient Clipping Accelerates Training: A Theoretical Justification f…☆46Updated 5 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Updated 3 years ago
- A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch☆77Updated 4 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- ☆29Updated 3 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Code base for SRSGD.☆28Updated 5 years ago
- Refining continuous-in-depth neural networks☆40Updated 3 years ago
- An encoder-decoder framework for learning from incomplete data☆45Updated last year
- MTAdam: Automatic Balancing of Multiple Training Loss Terms☆36Updated 4 years ago
- A PyTorch implement of Dilated RNN☆11Updated 7 years ago
- ☆12Updated 5 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Updated 5 years ago
- ZForcing Repo☆40Updated 7 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆19Updated 6 years ago
- Official code for UnICORNN (ICML 2021)☆27Updated 3 years ago