Implementation of Universal Transformer in Pytorch
☆266Nov 19, 2018Updated 7 years ago
Alternatives and similar repositories for Universal-Transformer-Pytorch
Users that are interested in Universal-Transformer-Pytorch are comparing it to the libraries listed below
Sorting:
- SemEval 2019 Task 4: Hyperpartisan News Detection☆13Nov 9, 2019Updated 6 years ago
- The bAbI question-answering dataset ported into T2T.☆32Dec 13, 2018Updated 7 years ago
- Hierarchical Attention for Dialogue Emotion Classification (SemEval, NAACL)☆44Jul 6, 2023Updated 2 years ago
- Neutron: A pytorch based implementation of Transformer and its variants.☆64Aug 10, 2023Updated 2 years ago
- Sparse and structured neural attention mechanisms☆225Aug 31, 2020Updated 5 years ago
- Transformer training code for sequential tasks☆609Sep 14, 2021Updated 4 years ago
- An implementation of DeepMind's Relational Recurrent Neural Networks (NeurIPS 2018) in PyTorch.☆246Dec 27, 2018Updated 7 years ago
- Personalizing Dialogue Agents via Meta-Learning☆131Jul 25, 2024Updated last year
- Simple XLNet implementation with Pytorch Wrapper☆580Jul 3, 2019Updated 6 years ago
- Implementation of End-to-End Memory Network in PyTorch☆106Aug 28, 2017Updated 8 years ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- Text Content Manipulation☆45Nov 16, 2020Updated 5 years ago
- PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (…☆144Jun 21, 2019Updated 6 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- Sequence-to-Sequence learning using PyTorch☆521Nov 12, 2019Updated 6 years ago
- A latent-variable model for learning bilingual word embedding mappings☆18Feb 11, 2019Updated 7 years ago
- Easy to use NLP library built on PyTorch and TorchText☆258Dec 7, 2019Updated 6 years ago
- Neural Module Network for Reasoning over Text, ICLR 2020☆120Oct 6, 2020Updated 5 years ago
- Linear-chain LSTM-CRFs and Convolutional CRFs in PyTorch.☆22Aug 11, 2017Updated 8 years ago
- EMNLP-2020: Cross-lingual Spoken Language Understanding with Regularized Representation Alignment☆18Nov 21, 2020Updated 5 years ago
- PyTorch code for ICLR 2019 paper: Global-to-local Memory Pointer Networks for Task-Oriented Dialogue https://arxiv.org/pdf/1901.04713☆160Jul 11, 2019Updated 6 years ago
- Implementation of AlphaZero in PyTorch.☆10Apr 19, 2019Updated 6 years ago
- Implementation of Neural Arithmetic Logic Units (https://arxiv.org/pdf/1808.00508.pdf)☆31Oct 24, 2018Updated 7 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 4 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- Latent Alignment and Variational Attention☆329Nov 5, 2018Updated 7 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,634Apr 16, 2024Updated last year
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆231Jul 16, 2019Updated 6 years ago
- WMT-2012 shared task on Quality Estimation☆18Sep 5, 2012Updated 13 years ago
- An open source framework for seq2seq models in PyTorch.☆1,517Sep 17, 2025Updated 5 months ago
- tunz's CUDA pytorch operator (MaskedSoftmax)☆75Mar 7, 2019Updated 6 years ago
- Single Headed Attention RNN - "Stop thinking with your head"☆1,180Nov 27, 2021Updated 4 years ago
- Generative Flow based Sequence-to-Sequence Toolkit written in Python.☆247Jan 28, 2020Updated 6 years ago
- Reinforcement Learning for Neural Machine Translation☆189Dec 29, 2024Updated last year
- Source code for "Efficient Training of BERT by Progressively Stacking"☆113Jul 3, 2019Updated 6 years ago
- Fast, general, and tested differentiable structured prediction in PyTorch☆1,123Apr 20, 2022Updated 3 years ago
- ☆19Apr 15, 2022Updated 3 years ago
- Variational Transformers for Diverse Response Generation☆82Jul 25, 2024Updated last year