Gladys-Zhao / mRNN-mLSTM
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Updated 4 years ago
Alternatives and similar repositories for mRNN-mLSTM
Users that are interested in mRNN-mLSTM are comparing it to the libraries listed below
Sorting:
- Python implementation of paper "AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks"☆14Updated 5 years ago
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- ☆40Updated 6 years ago
- Recursive Bayesian Networks☆11Updated 5 months ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Code for Augment & Reduce, a scalable stochastic algorithm for large categorical distributions☆10Updated 6 years ago
- A PyTorch implement of Dilated RNN☆11Updated 7 years ago
- ☆24Updated 5 years ago
- Stochastic Gradient Langevin Dynamics for Bayesian learning☆31Updated 3 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Updated 3 years ago
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Updated 6 years ago
- Complex domain recurrent neural network gating and Stiefel-manifold optimization in TensorFlow, Neural Information Processing Systems (Ne…☆50Updated 3 years ago
- ☆22Updated 3 years ago
- ☆33Updated 4 years ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆49Updated 2 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- Code for the paper "Disentangled Generative Models for Robust Prediction of System Dynamics"☆14Updated 2 years ago
- ☆27Updated 5 years ago
- ZForcing Repo☆40Updated 7 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Updated 4 years ago
- ☆12Updated 3 years ago
- ☆10Updated 3 years ago
- This repository contains the code used for Ordered Memory paper☆30Updated 5 years ago
- Code for: "Neural Controlled Differential Equations for Online Prediction Tasks"☆38Updated 2 years ago
- Self Supervised Learning for Time Series Using Similarity Distillation☆10Updated 2 years ago
- ☆27Updated 10 months ago
- Code for the paper "Feature Grouping as a Stochastic Regularizer for High-Dimensional Structured Data" at ICML 2019.☆20Updated 6 years ago
- Official repository for the paper "Fast Predictive Uncertainty for Classification with Bayesian Deep Networks". Accepted at UAI 2022. htt…☆12Updated 2 years ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Updated 5 years ago
- Training quantile models☆43Updated 5 months ago