Kyubyong / transformerLinks

A TensorFlow Implementation of the Transformer: Attention Is All You Need

☆4,388

Alternatives and similar repositories for transformer

Users that are interested in transformer are comparing it to the libraries listed below

Sorting:

kimiyoung / transformer-xl
☆3,664Updated 2 years ago
jadore801120 / attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
☆9,313Updated last year
codertimo / BERT-pytorch
Google AI 2018 BERT pytorch implementation
☆6,434Updated last year
bojone / attention
some attention implements
☆1,445Updated 5 years ago
IBM / pytorch-seq2seq
An open source framework for seq2seq models in PyTorch.
☆1,511Updated 2 months ago
rsennrich / subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
☆2,244Updated 11 months ago
OpenNMT / OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆6,921Updated 4 months ago
openai / finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
☆2,221Updated 6 years ago
jayparks / transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
☆559Updated 4 years ago
harvardnlp / annotated-transformer
An annotated implementation of the Transformer paper.
☆6,388Updated last year
tensorflow / tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
☆16,348Updated 2 years ago
DSKSD / DeepNLP-models-Pytorch
Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
☆2,952Updated 5 years ago
SamLynnEvans / Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
☆1,401Updated 2 years ago
LantaoYu / SeqGAN
Implementation of Sequence Generative Adversarial Nets with Policy Gradient
☆2,092Updated 6 years ago
zihangdai / xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,182Updated 2 years ago
philipperemy / keras-attention
Keras Attention Layer (Luong and Bahdanau scores).
☆2,812Updated last year
tensorflow / nmt
TensorFlow Neural Machine Translation Tutorial
☆6,436Updated 2 years ago
salesforce / awd-lstm-lm
LSTM and QRNN Language Model Toolkit for PyTorch
☆1,974Updated 3 years ago
allenai / bi-att-flow
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granul…
☆1,538Updated 2 years ago
huawei-noah / Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,118Updated last year
THUNLP-MT / MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
☆2,443Updated 11 months ago
abisee / pointer-generator
Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"
☆2,190Updated 3 years ago
huggingface / pytorch-openai-transformer-lm
🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI
☆1,511Updated 3 years ago
CyberZHG / keras-bert
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
☆2,426Updated 3 years ago
lilianweng / transformer-tensorflow
Implementation of Transformer Model in Tensorflow
☆472Updated 2 years ago
spro / practical-pytorch
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
☆4,549Updated 4 years ago
lsdefine / attention-is-all-you-need-keras
A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
☆711Updated 3 years ago
ilivans / tf-rnn-attention
Tensorflow implementation of attention mechanism for text classification tasks.
☆748Updated 5 years ago
keon / seq2seq
Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch
☆703Updated 4 years ago
namisan / mt-dnn
Multi-Task Deep Neural Networks for Natural Language Understanding
☆2,253Updated last year