cyk1337 / Transformer-in-PyTorchLinks
Transformer/Transformer-XL/R-Transformer examples and explanations
☆26Updated 3 years ago
Alternatives and similar repositories for Transformer-in-PyTorch
Users that are interested in Transformer-in-PyTorch are comparing it to the libraries listed below
Sorting:
- some basic deep learning models/method for nlp, text classification.☆79Updated 5 years ago
- Multi-head attention in PyTorch☆153Updated 6 years ago
- A pytorch implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)☆46Updated 6 years ago
- pytorch实现的基于attention is all your need提出的Q,K,V的attention模板和派生的attention实现。☆21Updated 5 years ago
- Pytorch implementation of Neural Machine Translation with seq2seq and attention (en-zh)☆41Updated 6 years ago
- This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…☆125Updated 3 years ago
- ☆167Updated 6 years ago
- basic hmm☆27Updated 7 years ago
- The code of Encoding Word Order in Complex-valued Embedding☆42Updated 6 years ago
- PyTorch implementation of "Effective Approaches to Attention-based Neural Machine Translation" using scheduled sampling to improve the pa…☆38Updated 8 years ago
- This is a repository for Multi-task learning with toy data in Pytorch and Tensorflow☆136Updated 6 years ago
- pytorch implementation of Attention is all you need☆238Updated 4 years ago
- A quick walk-through of the innards of LSTMs and a naive implementation of the Mogrifier LSTM paper in PyTorch☆78Updated 4 years ago
- Simple implement dilated LSTM, residual LSTM and Attention LSTM (follow the corresponding papers).☆17Updated 5 years ago
- Codes for "Learning Sparse Sharing Architectures for Multiple Tasks"☆95Updated 4 years ago
- A pytorch &keras implementation and demo of Fastformer.☆189Updated 2 years ago
- A Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"☆560Updated 4 years ago
- ☆131Updated 4 years ago
- Pytorch implementation of the methods proposed in **Adversarial Training Methods for Semi-Supervised Text Classification** on IMDB datase…☆42Updated 6 years ago
- Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al☆265Updated 2 years ago
- Keras implement of ON-LSTM (Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks)☆157Updated 5 years ago
- 《机器学习之类别不平衡问题》文章代码☆47Updated 6 years ago
- notes☆48Updated last year
- NLP Project + pytorch☆10Updated 4 years ago
- 高斯混合模型(GMM 聚类)的 EM 算法实现。☆200Updated 6 years ago
- star_transformer pytorch☆27Updated 5 years ago
- This in my Demo of Chen et al. "GradNorm: Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks" ICML 2018☆179Updated 3 years ago
- Implement the paper "Self-Attention with Relative Position Representations"☆136Updated 4 years ago
- Pytorch implementation of the GradNorm. GradNorm addresses the problem of balancing multiple losses for multi-task learning by learning a…☆269Updated 3 years ago
- ☆275Updated 3 years ago