THUNLP-MT / THUCC
An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group
☆51Updated 6 years ago
Alternatives and similar repositories for THUCC:
Users that are interested in THUCC are comparing it to the libraries listed below
- Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components☆99Updated 5 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆170Updated 6 years ago
- Codes for Lexical Sememe Prediction via Word Embeddings and Matrix Factorization (IJCAI 2017).☆60Updated 5 years ago
- An open-source neural machine translation system developed by Natural Language Processing Group, Nanjing University.☆103Updated 6 years ago
- Interpoetry: Generating Classical Chinese Poems from Vernacular Chinese.☆42Updated 5 years ago
- Improved Word Representation Learning with Sememes☆197Updated 6 years ago
- Code of EMNLP paper: http://aclweb.org/anthology/D18-1531☆62Updated 6 years ago
- Revised Version of SAT Model in "Improved Word Representation Learning with Sememes"☆49Updated 4 years ago
- Python scripts preprocessing Penn Treebank and Chinese Treebank☆162Updated 4 years ago
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Updated 2 years ago
- Investigating Prior Knowledge for Challenging Chinese Machine Reading Comprehension☆166Updated 3 years ago
- Improving Machine Reading Comprehension with General Reading Strategies☆37Updated 6 years ago
- ☆28Updated 6 years ago
- Cross-Lingual Machine Reading Comprehension (EMNLP 2019)☆68Updated 5 years ago
- The source codes of Working Memory model for Chinese poetry generation (IJCAI 2018).☆57Updated 4 years ago
- The First Evaluation Workshop on Chinese Machine Reading Comprehension (CMRC 2017)☆91Updated 5 years ago
- machine translation and quality estimation☆34Updated 6 years ago
- SemEval-2016 Task 9: Chinese Semantic Dependency Parsing☆135Updated 7 years ago
- Collections of Chinese reading comprehension datasets☆217Updated 5 years ago
- Document-Level Neural Machine Translation with Hierarchical Attention Networks☆67Updated 2 years ago
- ☆78Updated 6 years ago
- Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"☆23Updated 2 years ago
- Prior Knowledge Integration for Neural Machine Translation using Posterior Regularization☆11Updated 6 years ago
- Code accompanying Incorporating Chinese Characters of Words for Lexical Sememe Prediction (ACL2018) https://arxiv.org/abs/1806.06349☆25Updated 6 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 3 years ago
- Re-rank n-best lists using additional features.☆28Updated 6 years ago
- A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF☆153Updated 7 years ago
- This is the repository for NLPCC2020 task AutoIE☆51Updated 4 years ago
- Source code and data for ACL 2019 paper "Modeling Semantic Compositionality with Sememe Knowledge"☆35Updated 4 years ago
- BiAffine Dependency Parsing☆53Updated 6 years ago