tunz / transformer-pytorchLinks

Transformer implementation in PyTorch.

☆490

Alternatives and similar repositories for transformer-pytorch

Users that are interested in transformer-pytorch are comparing it to the libraries listed below

Sorting:

jayparks / transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
☆566Updated 5 years ago
sooftware / attentions
PyTorch implementation of some attentions for Deep Learning Researchers.
☆547Updated 3 years ago
SamLynnEvans / Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
☆1,419Updated 2 years ago
hyunwoongko / transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆4,252Updated 4 months ago
tatp22 / linformer-pytorch
My take on a practical implementation of Linformer for Pytorch.
☆421Updated 3 years ago
evelinehong / Transformer_Relative_Position_PyTorch
Implement the paper "Self-Attention with Relative Position Representations"
☆139Updated 4 years ago
dhlee347 / pytorchic-bert
Pytorch Implementation of Google BERT
☆597Updated 5 years ago
lucidrains / reformer-pytorch
Reformer, the efficient Transformer, in Pytorch
☆2,185Updated 2 years ago
emadRad / lstm-gru-pytorch
LSTM and GRU in PyTorch
☆265Updated 6 years ago
CyberZHG / torch-multi-head-attention
Multi-head attention in PyTorch
☆154Updated 6 years ago
lucidrains / performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
☆1,162Updated 3 years ago
lucidrains / mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
☆1,053Updated 4 months ago
lilianweng / transformer-tensorflow
Implementation of Transformer Model in Tensorflow
☆477Updated 2 years ago
lucidrains / linear-attention-transformer
Transformer based on a variant of attention that is linear complexity in respect to sequence length
☆812Updated last year
Separius / awesome-fast-attention
list of efficient attention modules
☆1,018Updated 4 years ago
idiap / fast-transformers
Pytorch library for fast transformer implementations
☆1,752Updated 2 years ago
budzianowski / PyTorch-Beam-Search-Decoding
PyTorch implementation of beam search decoding for seq2seq models
☆340Updated 2 years ago
keon / seq2seq
Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch
☆701Updated 4 years ago
davidmrau / mixture-of-experts
PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538
☆1,206Updated last year
mit-han-lab / lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention
☆610Updated last year
dropreg / R-Drop
☆883Updated last year
lucidrains / FLASH-pytorch
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
☆370Updated 2 years ago
wuch15 / Fastformer
A pytorch &keras implementation and demo of Fastformer.
☆190Updated 3 years ago
leviswind / pytorch-transformer
pytorch implementation of Attention is all you need
☆239Updated 4 years ago
yaohungt / Multimodal-Transformer
[ACL'19] [PyTorch] Multimodal Transformer
☆930Updated 3 years ago
suragnair / seqGAN
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
☆649Updated 7 years ago
dreamgonfly / transformer-pytorch
A PyTorch implementation of Transformer in "Attention is All You Need"
☆106Updated 4 years ago
jadore801120 / attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
☆9,505Updated last year
kimiyoung / transformer-xl
☆3,678Updated 3 years ago
guocheng2025 / Transformer-Encoder
Implementation of Transformer encoder in PyTorch
☆70Updated 5 years ago