jiaohuix / nmt_data_toolsLinks
machine translation data process tools
☆10Updated last year
Alternatives and similar repositories for nmt_data_tools
Users that are interested in nmt_data_tools are comparing it to the libraries listed below
Sorting:
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆44Updated last year
- Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …☆10Updated 2 years ago
- ☆53Updated 3 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆41Updated 3 weeks ago
- ☆40Updated last year
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆29Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- ☆34Updated 2 years ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- Plug-and-Play Document Modules for Pre-trained Models☆26Updated 2 years ago
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Updated 2 years ago
- ☆59Updated last year
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆29Updated last year
- ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template☆22Updated 2 years ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆58Updated last year
- This is the code repo for the paper <UTC-IE: A Unified Token-pair Classification Architecture for Information Extraction>☆15Updated last year
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- code for Teaching LM to Translate with Comparison☆39Updated last year
- ACL Paper Lists(machine translation)☆13Updated 3 years ago
- Code for "Improving Translation Faithfulness of Large Language Models via Augmenting Instructions"☆12Updated last year
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Updated 3 years ago
- The official implementation of ACL2022``Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks''☆33Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆81Updated last year
- This is the third version of the practices for the rookies of BJTUNLPers.☆9Updated 3 years ago
- The official repo for the paper "Teacher Forcing Recovers Reward Functions for Text Generation"☆31Updated 2 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆83Updated last year
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Updated last year