cyk1337 / Transformer-in-PyTorchLinks

Transformer/Transformer-XL/R-Transformer examples and explanations

☆26

Alternatives and similar repositories for Transformer-in-PyTorch

Users that are interested in Transformer-in-PyTorch are comparing it to the libraries listed below

Sorting:

zy1996code / nlp_basic_model
some basic deep learning models/method for nlp, text classification.
☆79Updated 5 years ago
CyberZHG / torch-multi-head-attention
Multi-head attention in PyTorch
☆153Updated 6 years ago
pengshuang / CNN-Seq2Seq
A pytorch implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)
☆46Updated 6 years ago
sakuranew / attention-pytorch
pytorch实现的基于attention is all your need提出的Q,K,V的attention模板和派生的attention实现。
☆21Updated 5 years ago
jasperzhong / NMT
Pytorch implementation of Neural Machine Translation with seq2seq and attention (en-zh)
☆41Updated 6 years ago
monk1337 / Various-Attention-mechanisms
This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…
☆125Updated 3 years ago
u784799i / biLSTM_attn
☆167Updated 6 years ago
Continue7777 / HMM
basic hmm
☆27Updated 7 years ago
zhaodongh / Encoding-Word-Order-in-Complex-valued-Embedding
The code of Encoding Word Order in Complex-valued Embedding
☆42Updated 6 years ago
lingyongyan / Neural-Machine-Translation
PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…
☆38Updated 8 years ago
hosseinshn / Basic-Multi-task-Learning
This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow
☆136Updated 6 years ago
leviswind / pytorch-transformer
pytorch implementation of Attention is all you need
☆238Updated 4 years ago
RMichaelSwan / MogrifierLSTM
A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch
☆78Updated 4 years ago
trongthanhptnk / Dilated_Res_Attention_LSTM
Simple implement dilated LSTM, residual LSTM and Attention LSTM (follow the corresponding papers).
☆17Updated 5 years ago
choosewhatulike / sparse-sharing
Codes for "Learning Sparse Sharing Architectures for Multiple Tasks"
☆95Updated 4 years ago
wuch15 / Fastformer
A pytorch &keras implementation and demo of Fastformer.
☆189Updated 2 years ago
jayparks / transformer
A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
☆560Updated 4 years ago
uhauha2929 / examples
☆131Updated 4 years ago
WangJiuniu / adversarial_training
Pytorch implementation of the methods proposed in **Adversarial Training Methods for Semi-Supervised Text Classification** on IMDB datase…
☆42Updated 6 years ago
xinyi-code / NLP-Loss-Pytorch
Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al
☆265Updated 2 years ago
bojone / on-lstm
Keras implement of ON-LSTM (Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks)
☆157Updated 5 years ago
massquantity / Class-Imbalance
《机器学习之类别不平衡问题》文章代码
☆47Updated 6 years ago
keithyin / mynotes
notes
☆48Updated last year
douzujun / NLP-Project
NLP Project + pytorch
☆10Updated 4 years ago
wrayzheng / gmm-em-clustering
高斯混合模型（GMM 聚类）的 EM 算法实现。
☆200Updated 6 years ago
liujiarik / nlp_clip_pytorch
star_transformer pytorch
☆27Updated 5 years ago
hosseinshn / GradNorm
This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018
☆179Updated 3 years ago
evelinehong / Transformer_Relative_Position_PyTorch
Implement the paper "Self-Attention with Relative Position Representations"
☆136Updated 4 years ago
brianlan / pytorch-grad-norm
Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…
☆269Updated 3 years ago
albertwy / BiLSTM
☆275Updated 3 years ago