SamLynnEvans / Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
☆1,395Updated last year
Alternatives and similar repositories for Transformer
Users that are interested in Transformer are comparing it to the libraries listed below
Sorting:
- ☆3,646Updated 2 years ago
- Google AI 2018 BERT pytorch implementation☆6,391Updated last year
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆556Updated 4 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,364Updated last year
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,177Updated last year
- Longformer: The Long-Document Transformer☆2,120Updated 2 years ago
- Reformer, the efficient Transformer, in Pytorch☆2,168Updated last year
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,571Updated 4 years ago
- An annotated implementation of the Transformer paper.☆6,220Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,232Updated 9 months ago
- Transformer implementation in PyTorch.☆486Updated 6 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,251Updated last year
- Pytorch Implementation of Google BERT☆593Updated 5 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,554Updated 7 months ago
- Code and model for the paper "Improving Language Understanding by Generative Pre-Training"☆2,208Updated 6 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,906Updated 2 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,509Updated 3 years ago
- Transformer: PyTorch Implementation of "Attention Is All You Need"☆3,692Updated 9 months ago
- Pytorch library for fast transformer implementations☆1,703Updated 2 years ago
- Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch☆699Updated 4 years ago
- Open Source Neural Machine Translation and (Large) Language Models in PyTorch☆6,871Updated 2 months ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,119Updated 2 years ago
- Must-read Papers on pre-trained language models.☆3,358Updated 2 years ago
- A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)☆646Updated 6 years ago
- [DEPRECATED] Repo for exploring multi-task learning approaches to learning sentence representations☆794Updated 3 years ago
- Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.☆5,546Updated last year
- Simple XLNet implementation with Pytorch Wrapper☆581Updated 5 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,183Updated 3 years ago
- A fast MoE impl for PyTorch☆1,720Updated 3 months ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,353Updated last year