dayihengliu / a2m_chineseNMT
Dataset for TALLIP2019 paper "Ancient-Modern Chinese Translation with a New Large Training Dataset"
☆22Updated 2 years ago
Alternatives and similar repositories for a2m_chineseNMT:
Users that are interested in a2m_chineseNMT are comparing it to the libraries listed below
- 古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆47Updated last year
- A Multi-tasking and Multi-stage Chinese Minority Pre-Trained Language Model☆10Updated last year
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆81Updated 11 months ago
- Yet Another Chinese Learner Corpus☆77Updated 3 years ago
- Dynamic Connected Networks for Chinese Spelling Check☆50Updated 10 months ago
- Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.☆45Updated 2 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 3 years ago
- Code and data of the paper "MCTS: A Multi-Reference Chinese Text Simplification Dataset".☆29Updated 8 months ago
- Code of zlyang's master dissertation for Chinese grammatical error correction.☆34Updated 5 years ago
- ☆55Updated last year
- A grammatical error correction reading list maintained by Beijing Language and Culture University Natural Language Processing Group☆24Updated 4 years ago
- ☆17Updated 7 years ago
- <u><a href="https://circse.github.io/LT4HALA/" style="color: white">Workshop on Language Technologies for Historical and Ancient Language…☆33Updated 8 months ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Updated 3 years ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 3 years ago
- ☆32Updated 2 years ago
- The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models☆24Updated 3 years ago
- A grammatical error correction reading list maintained by BLCU ICALL Research Group☆46Updated 2 years ago
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Updated 2 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Updated 7 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆67Updated 3 years ago
- [EACL'21] Non-Autoregressive with Pretrained Language Model☆62Updated 2 years ago
- ExpMRC: Explainability Evaluation for Machine Reading Comprehension☆62Updated last year
- The repository for the paper: Rethinking Document-level Neural Machine Translation☆25Updated 2 years ago
- Repository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)☆68Updated 5 years ago
- code of our EMNLP-19 Paper, CM-Net: A Novel Collaborative Memory Network for Spoken Language Understanding☆27Updated 5 years ago
- Pre-processing and training scripts for WMT 2017 ZH-EN translation task☆39Updated 4 years ago
- A dataset and baselines for CLS.☆11Updated 2 years ago
- ☆9Updated 10 months ago