A PyTorch implementation of Attention is all you need
☆43Oct 16, 2018Updated 7 years ago
Alternatives and similar repositories for Transformer-PyTorch
Users that are interested in Transformer-PyTorch are comparing it to the libraries listed below
Sorting:
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Dec 9, 2017Updated 8 years ago
- ☆13Aug 20, 2021Updated 4 years ago
- Parallel SGD, done locally and remote☆14May 19, 2016Updated 9 years ago
- A Python wrapper for the ROUGE summarization evaluation package☆14Aug 9, 2017Updated 8 years ago
- Implementation of the attention-sum reader using tensorflow and keras.☆11Aug 1, 2017Updated 8 years ago
- ☆25May 21, 2018Updated 7 years ago
- ☆14Dec 7, 2022Updated 3 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆40Jun 7, 2020Updated 5 years ago
- Python wrapper for evaluating summarization quality by ROUGE package☆162May 25, 2020Updated 5 years ago
- ☆16Apr 11, 2022Updated 3 years ago
- 多语言降噪预训练模型MBart的中文生成任务☆11May 27, 2021Updated 4 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- PyTorch implementation of Transformer-based Neural Machine Translation☆78Dec 14, 2022Updated 3 years ago
- Semantic Parser with Execution☆28Sep 10, 2018Updated 7 years ago
- ☆17Oct 9, 2022Updated 3 years ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Sep 22, 2025Updated 5 months ago
- ☆10Nov 15, 2020Updated 5 years ago
- 야자타임 (a.k.a. 야밤의 자연어처리 타임)☆27Mar 31, 2021Updated 4 years ago
- Library for implementing RNNs with Theano☆11Mar 26, 2015Updated 10 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- 针对常见的BAT公司中的大数据面试和笔试问题,列出解决思路,并使用python来实现☆11Aug 17, 2015Updated 10 years ago
- Code and dataset for "Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction" (EMNLP'19)☆13May 18, 2020Updated 5 years ago
- Distinguishing Antonyms and Synonyms in a Pattern-based Neural Network☆16Feb 24, 2017Updated 9 years ago
- Here, I provided the solution for exercises of IBM Quantum Challenge 2020☆10Oct 27, 2020Updated 5 years ago
- Sequence-Level Mixed Sample Data Augmentation☆23Mar 7, 2021Updated 5 years ago
- An implementation of an autoregressive language model using an improved Transformer and DeepSpeed pipeline parallelism.☆30Jan 12, 2026Updated 2 months ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- Code and data for automatic paraphrase dataset augmentation.☆11Mar 8, 2021Updated 5 years ago
- Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only☆50Jul 25, 2024Updated last year
- Implementation of a dependency parser using neural networks☆11Mar 7, 2017Updated 9 years ago
- Top-down Tree LSTM (NAACL 2016) http://aclweb.org/anthology/N/N16/N16-1035.pdf☆83Nov 29, 2016Updated 9 years ago
- ☆182Aug 17, 2018Updated 7 years ago
- Text Style Transfer: A Review☆13Jun 1, 2019Updated 6 years ago
- N/A☆18Aug 15, 2022Updated 3 years ago
- Locality-Sensitive Bloom Filter for Approximate Membership Query