jiaohuix / nmt_data_toolsLinks
machine translation data process tools
☆10Updated last year
Alternatives and similar repositories for nmt_data_tools
Users that are interested in nmt_data_tools are comparing it to the libraries listed below
Sorting:
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Updated 2 years ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated last year
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Updated last year
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆22Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 3 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated 2 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- ☆54Updated 3 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Updated last year
- ☆35Updated 2 years ago
- Plug-and-Play Document Modules for Pre-trained Models☆26Updated 2 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Updated 2 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆38Updated 2 years ago
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated 2 years ago
- ✒️ ChatGPT as a writing partner.☆14Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Updated 2 years ago
- ☆14Updated 3 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆86Updated last year
- ☆33Updated last year
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆23Updated last year
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆19Updated 2 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated 2 years ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆32Updated 3 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Updated 2 years ago
- ROUGE for multilingual Summarization☆25Updated 4 years ago
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Updated 2 years ago
- Source codes of ACL 2022-Efficient Cluster-based k-Nearest-Neighbor Machine Translation☆26Updated 3 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated last year