THUNLP-MT / THUMT
An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group
☆707Updated 3 years ago
Alternatives and similar repositories for THUMT:
Users that are interested in THUMT are comparing it to the libraries listed below
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,119Updated 2 years ago
- Baseline Systems of DuReader Dataset☆1,144Updated 2 years ago
- This repo contains our ACL 2017 paper data and source code☆724Updated 4 years ago
- Four word embedding models implemented in Python. Supporting arbitrary context features☆850Updated 5 years ago
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,246Updated 2 years ago
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,232Updated 9 months ago
- A machine translation reading list maintained by Tsinghua Natural Language Processing Group☆2,443Updated 8 months ago
- NMT for chinese-english using fairseq☆214Updated 7 years ago
- Empower Sequence Labeling with Task-Aware Language Model☆847Updated 2 years ago
- Simple, fast unsupervised word aligner☆752Updated 2 years ago
- Must-read papers on Machine Reading Comprehension☆893Updated 4 years ago
- Pre-Trained Chinese XLNet(中文XLNet预训练模型)☆1,649Updated 2 years ago
- Named Entity Recognition for Chinese social media (Weibo). From EMNLP 2015 paper.☆550Updated 4 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,781Updated 2 years ago
- Deep Semantic Role Labeling with Self-Attention☆307Updated 5 years ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,415Updated last year
- The score code of FastBERT (ACL2020)☆605Updated 3 years ago
- bert nlp papers, applications and github resources, including the newst xlnet , BERT、XLNet 相关论文和 github 项目☆1,848Updated 4 years ago
- Open-Source Neural Machine Translation in Tensorflow☆797Updated 2 years ago
- Chinese NER using Lattice LSTM. Code for ACL 2018 paper.☆1,813Updated 6 years ago
- cw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information☆272Updated 2 years ago
- ☆363Updated 2 years ago
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,418Updated 2 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆815Updated 4 years ago
- Tensorflow implementation of contextualized word representations from bi-directional language models☆1,619Updated 2 years ago
- A Tensorflow implementation of QANet for machine reading comprehension☆981Updated 6 years ago
- ☆442Updated 2 years ago
- BERT for Multitask Learning☆547Updated 2 years ago
- Use Google's BERT for named entity recognition (CoNLL-2003 as the dataset).☆1,259Updated 2 years ago
- Simple Solution for Multi-Criteria Chinese Word Segmentation☆302Updated 4 years ago