asappresearch / sruLinks
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
☆2,098Updated 3 years ago
Alternatives and similar repositories for sru
Users that are interested in sru are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM☆1,261Updated 3 years ago
- Deep learning with dynamic computation graphs in TensorFlow☆1,823Updated 3 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit☆3,740Updated 3 years ago
- An open source framework for seq2seq models in PyTorch.☆1,509Updated last month
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,974Updated 3 years ago
- Sequence-to-sequence model with LSTM encoder/decoders and attention☆1,274Updated 4 years ago
- Sequence to Sequence Learning with Keras☆3,175Updated 2 years ago
- Sequence to Sequence Models with PyTorch☆739Updated 3 years ago
- Framework for building complex recurrent neural networks with Keras☆765Updated 2 years ago
- Dynamic seq2seq in TensorFlow, step by step☆996Updated 7 years ago
- InferSent sentence embeddings☆2,284Updated 3 years ago
- Sent2Vec encoder and training code from the paper "Skip-Thought Vectors"☆2,051Updated 5 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,182Updated 3 years ago
- Memory Networks implementations☆1,754Updated 4 years ago
- Phrase-Based & Neural Unsupervised Machine Translation☆1,504Updated 3 years ago
- Sequence-to-Sequence learning using PyTorch☆521Updated 5 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,912Updated 2 years ago
- A general-purpose encoder-decoder framework for Tensorflow☆5,618Updated 4 years ago
- An optimizer that trains as fast as Adam and as good as SGD.☆2,917Updated last year
- Implementation of Sequence Generative Adversarial Nets with Policy Gradient☆2,092Updated 6 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,577Updated 4 years ago
- Tutorials and implementations for "Self-normalizing networks"☆1,586Updated 3 years ago
- Various tutorials given for welcoming new students at MILA.☆985Updated 6 years ago
- Visualizing RNNs using the attention mechanism☆750Updated 5 years ago
- Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…☆1,536Updated 2 years ago
- TensorFlow implementation of Independently Recurrent Neural Networks☆513Updated 3 years ago
- Unsupervised Language Modeling at scale for robust sentiment classification☆1,063Updated 4 years ago
- "End-To-End Memory Networks" in Tensorflow☆827Updated 8 years ago
- Lingvo☆2,843Updated this week
- Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction☆502Updated 4 years ago