jadore801120 / attention-is-all-you-need-pytorchLinks

A PyTorch implementation of the Transformer model in "Attention is All You Need".

☆9,313

Alternatives and similar repositories for attention-is-all-you-need-pytorch

Users that are interested in attention-is-all-you-need-pytorch are comparing it to the libraries listed below

Sorting:

codertimo / BERT-pytorch
Google AI 2018 BERT pytorch implementation
☆6,434Updated last year
harvardnlp / annotated-transformer
An annotated implementation of the Transformer paper.
☆6,400Updated last year
Kyubyong / transformer
A TensorFlow Implementation of the Transformer: Attention Is All You Need
☆4,388Updated 2 years ago
hyunwoongko / transformer
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆3,915Updated 2 weeks ago
tensorflow / tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
☆16,348Updated 2 years ago
lanpa / tensorboardX
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
☆7,960Updated last month
kimiyoung / transformer-xl
☆3,664Updated 2 years ago
pytorch / examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
☆23,270Updated last week
SamLynnEvans / Transformer
Transformer seq2seq model, program that can build a language translator from parallel corpus
☆1,401Updated 2 years ago
bentrevett / pytorch-seq2seq
Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.
☆5,589Updated last year
NVIDIA / apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
☆8,745Updated last week
OpenNMT / OpenNMT-py
Open Source Neural Machine Translation and (Large) Language Models in PyTorch
☆6,921Updated 4 months ago
jayparks / transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
☆560Updated 4 years ago
tunz / transformer-pytorch
Transformer implementation in PyTorch.
☆493Updated 6 years ago
sksq96 / pytorch-summary
Model summary in PyTorch similar to `model.summary()` in Keras
☆4,050Updated last year
pytorch / text
Models, data loaders and abstractions for language processing, powered by PyTorch
☆3,548Updated this week
pliang279 / awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
☆6,566Updated 11 months ago
jessevig / bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
☆7,580Updated 2 months ago
google-research / vision_transformer
☆11,632Updated 4 months ago
zihangdai / xlnet
XLNet: Generalized Autoregressive Pretraining for Language Understanding
☆6,182Updated 2 years ago
DSKSD / DeepNLP-models-Pytorch
Pytorch implementations of various Deep NLP models in cs-224n(Stanford Univ)
☆2,952Updated 5 years ago
tkipf / pygcn
Graph Convolutional Networks in PyTorch
☆5,326Updated 4 years ago
Cadene / pretrained-models.pytorch
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
☆9,100Updated 3 years ago
yunjey / pytorch-tutorial
PyTorch Tutorial for Deep Learning Researchers
☆31,575Updated last year
MorvanZhou / PyTorch-Tutorial
Build your neural network easy and fast, 莫烦Python中文教学
☆8,338Updated 2 years ago
jcjohnson / pytorch-examples
Simple examples to introduce PyTorch
☆4,813Updated 3 years ago
spro / practical-pytorch
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
☆4,549Updated 4 years ago
cs230-stanford / cs230-code-examples
Code examples in pyTorch and Tensorflow for CS230
☆4,066Updated 2 years ago
dkozlov / awesome-knowledge-distillation
Awesome Knowledge Distillation
☆3,716Updated last month
pytorch / vision
Datasets, Transforms and Models specific to Computer Vision
☆17,046Updated this week