jiaohuix / nmt_data_toolsLinks
machine translation data process tools
☆10Updated last year
Alternatives and similar repositories for nmt_data_tools
Users that are interested in nmt_data_tools are comparing it to the libraries listed below
Sorting:
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Updated last year
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆28Updated last year
- ☆32Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated last year
- ☆33Updated last year
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆22Updated last year
- Code of the COLING22 paper "uChecker: Masked Pretrained Language Models as Unsupervised Chinese Spelling Checkers"☆19Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated last year
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Updated 3 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Updated last year
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆29Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated last year
- Implementation of latent-GLAT (ACL-2022)☆33Updated 3 years ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆31Updated 2 years ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 11 months ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- Code for embedding and retrieval research.☆16Updated last year
- ☆37Updated last year
- code repo for EMNLP'21 Finding Counter-Interference Adapter for Multilingual Machine Translation☆18Updated 2 years ago
- ☆53Updated 3 years ago
- LLM4MT☆9Updated last year
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆39Updated last year
- ☆14Updated 2 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Updated 11 months ago