THUNLP-MT / THUMT
An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group
☆705Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for THUMT
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,118Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,197Updated 3 months ago
- Simple, fast unsupervised word aligner☆738Updated 2 years ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,412Updated 10 months ago
- NMT for chinese-english using fairseq☆211Updated 7 years ago
- Open-Source Neural Machine Translation in Tensorflow☆797Updated last year
- This repo contains our ACL 2017 paper data and source code☆721Updated 4 years ago
- Must-read papers on Machine Reading Comprehension☆893Updated 4 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆846Updated 5 years ago
- ☆360Updated last year
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,850Updated 3 years ago
- Moses, the machine translation system☆1,583Updated 5 months ago
- Neural machine translation and sequence learning using TensorFlow☆1,458Updated last year
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,243Updated 2 years ago
- BERT for Multitask Learning☆546Updated last year
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,890Updated 2 years ago
- A machine translation reading list maintained by Tsinghua Natural Language Processing Group☆2,432Updated 3 months ago
- Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN☆959Updated 5 years ago
- Baseline Systems of DuReader Dataset☆1,135Updated 2 years ago
- Pre-trained ELMo Representations for Many Languages☆1,463Updated 3 years ago
- Tensorflow Implementation of R-Net☆578Updated 6 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,653Updated last year
- The score code of FastBERT (ACL2020)☆604Updated 3 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,246Updated 2 years ago
- Evaluating Cross-lingual Sentence Representations☆442Updated 3 years ago
- An open-source neural machine translation system developed by Natural Language Processing Group, Nanjing University.☆99Updated 6 years ago
- BERT as language model, fork from https://github.com/google-research/bert☆247Updated 8 months ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,620Updated last year
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,003Updated 6 months ago
- Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.☆546Updated 4 years ago