THUNLP-MT / THUCCLinks
An open-source classical Chinese information processing toolkit developed by Tsinghua Natural Language Processing Group
☆51Updated 6 years ago
Alternatives and similar repositories for THUCC
Users that are interested in THUCC are comparing it to the libraries listed below
Sorting:
- Interpoetry: Generating Classical Chinese Poems from Vernacular Chinese.☆43Updated 5 years ago
- The source codes of Working Memory model for Chinese poetry generation (IJCAI 2018).☆57Updated 5 years ago
- Python scripts preprocessing Penn Treebank and Chinese Treebank☆161Updated 5 years ago
- An open-source neural machine translation system developed by Natural Language Processing Group, Nanjing University.☆103Updated 7 years ago
- A paper list of automatic poetry generation, analysis, translation, etc.☆186Updated 4 years ago
- Codes for Lexical Sememe Prediction via Word Embeddings and Matrix Factorization (IJCAI 2017).☆61Updated 5 years ago
- Joint Embeddings of Chinese Words, Characters, and Fine-grained Subcharacter Components☆100Updated 6 years ago
- ChID: A Large-scale Chinese IDiom Dataset for Cloze Test☆151Updated 2 years ago
- Improved Word Representation Learning with Sememes☆198Updated 7 years ago
- A Chinese Cloze-style RC Dataset: People's Daily & Children's Fairy Tale (CFT)☆172Updated 6 years ago
- SemEval-2016 Task 9: Chinese Semantic Dependency Parsing☆138Updated 7 years ago
- Code of EMNLP paper: http://aclweb.org/anthology/D18-1531☆62Updated 6 years ago
- machine translation and quality estimation☆34Updated 6 years ago
- BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry☆156Updated 3 years ago
- Must-read Papers on Sememe Computation☆198Updated 2 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- Revised Version of SAT Model in "Improved Word Representation Learning with Sememes"☆50Updated 5 years ago
- ☆96Updated last month
- Chinese GPT2: pre-training and fine-tuning framework for text generation☆187Updated 4 years ago
- Python version of the evaluation script from CoNLL'00-☆93Updated 4 years ago
- The source code of ACL 2018 paper "Denoising Distantly Supervised Open-Domain Question Answering".☆206Updated 6 years ago
- Collections of Chinese reading comprehension datasets☆220Updated 5 years ago
- Poetry-related datasets developed by THUAIPoet (Jiuge) group.☆230Updated 5 years ago
- A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF☆154Updated 7 years ago
- ☆87Updated 7 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆39Updated 5 years ago
- MAsked Sequence to Sequence (MASS) pre-training for language generation☆21Updated 6 years ago
- Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)☆66Updated 6 years ago
- LSTM-based dependency graph parser with Bi-LSTM Subtraction and Incremental Tree-LSTM☆28Updated 7 years ago
- NMT for chinese-english using tensor2tensor☆47Updated 7 years ago