Gladys-Zhao / mRNN-mLSTMLinks
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Updated 4 years ago
Alternatives and similar repositories for mRNN-mLSTM
Users that are interested in mRNN-mLSTM are comparing it to the libraries listed below
Sorting:
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆15Updated 6 years ago
- ☆40Updated 7 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Implementation of Mogrifier LSTM in PyTorch☆34Updated 5 years ago
- A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch☆78Updated 5 years ago
- ☆33Updated 4 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated 3 months ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- Spectral Attention Autoregressive Model (SAAM)☆16Updated 2 years ago
- ☆12Updated 3 years ago
- ☆24Updated 5 years ago
- ☆24Updated 3 months ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 7 years ago
- ☆22Updated 3 years ago
- Repository for Beyond Pinball Loss: Quantile Methods for Calibrated Uncertainty Quantification (NeurIPS 2024)☆43Updated 9 months ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 7 years ago
- ☆10Updated 3 years ago
- ☆23Updated 11 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Updated 7 months ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated 2 months ago
- ☆20Updated 2 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Updated 3 years ago
- An encoder-decoder framework for learning from incomplete data☆44Updated 2 years ago
- Code used for the AAAI 2020 paper "System Identification with Time-Aware Neural Sequence Models"☆17Updated 5 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆60Updated 5 years ago
- A straightforward implementation of EGBM-based Generalized Additive Model☆13Updated 4 years ago
- Code for: "Neural Controlled Differential Equations for Online Prediction Tasks"☆38Updated 2 years ago
- Blog post☆17Updated last year
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆25Updated 4 years ago
- ☆15Updated 3 years ago