jayparks / transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
☆551Updated 4 years ago
Alternatives and similar repositories for transformer:
Users that are interested in transformer are comparing it to the libraries listed below
- pytorch implementation of Attention is all you need☆239Updated 3 years ago
- Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch☆695Updated 4 years ago
- Transformer implementation in PyTorch.☆477Updated 5 years ago
- some attention implements☆1,440Updated 5 years ago
- PyTorch implementation of batched bi-RNN encoder and attention-decoder.☆279Updated 6 years ago
- Transformer seq2seq model, program that can build a language translator from parallel corpus☆1,373Updated last year
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆606Updated 7 months ago
- Pytorch Implementation of Google BERT☆593Updated 4 years ago
- PyTorch implementation of beam search decoding for seq2seq models☆338Updated 2 years ago
- Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"☆577Updated 5 years ago
- ☆166Updated 5 years ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,101Updated 2 years ago
- ☆872Updated 8 months ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆227Updated 5 years ago
- A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)☆643Updated 6 years ago
- ☆3,635Updated 2 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,152Updated last year
- list of efficient attention modules☆995Updated 3 years ago
- LSTM-CRF in PyTorch☆459Updated 6 months ago
- A Structured Self-attentive Sentence Embedding☆492Updated 5 years ago
- CNNs for Sentence Classification in PyTorch☆1,024Updated 3 weeks ago
- Hierarchical Attention Networks for Document Classification in PyTorch☆605Updated 4 years ago
- Hierarchical Attention Networks for document classification☆396Updated 3 years ago
- star_transformer pytorch☆27Updated 5 years ago
- (Linear-chain) Conditional random field in PyTorch.☆957Updated 8 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,550Updated 4 years ago
- Multi-head attention in PyTorch☆150Updated 5 years ago
- ☆230Updated 5 years ago
- ☆264Updated 2 years ago
- PyTorch Re-Implementation of "Generating Sentences from a Continuous Space" by Bowman et al 2015 https://arxiv.org/abs/1511.06349☆590Updated last month