thunlp / SubCharTokenizationLinks
☆44Updated 2 years ago
Alternatives and similar repositories for SubCharTokenization
Users that are interested in SubCharTokenization are comparing it to the libraries listed below
Sorting:
- ☆32Updated 2 years ago
- ☆41Updated 2 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆29Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- reStructured Pre-training☆98Updated 2 years ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆177Updated 7 months ago
- ⚡Research papers about leveraging the capabilities of language models⚡☆52Updated 2 years ago
- ☆34Updated 2 years ago
- Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"☆22Updated last year
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆60Updated 4 years ago
- Tools for formatting WMT hypothesis and test sets in XML☆27Updated 4 months ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆37Updated 3 years ago
- ☆20Updated last year
- ACL Paper Lists(machine translation)☆13Updated 3 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆79Updated last year
- The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 202…☆47Updated 2 years ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated 2 years ago
- Code & data for our EMNLP2022 paper "SynGEC: Syntax-Enhanced Grammatical Error Correction with a Tailored GEC-Oriented Parser"☆84Updated last year
- ROUGE for multilingual Summarization☆25Updated 3 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆101Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated last year
- Logiqa2.0 dataset - logical reasoning in MRC and NLI tasks☆99Updated 2 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆39Updated 2 years ago
- The unified platform for data-related resources.☆134Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆99Updated last year
- GIFT (ACL 2023) & MPC-BERT (ACL 2021) for Multi-Party Conversation Understanding☆41Updated 2 years ago
- [ACL 2022] Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation☆31Updated last year
- Code for AAAI 2021 paper "Lexically Constrained Neural Machine Translation with Explicit Alignment Guidance"☆25Updated 2 years ago
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆25Updated 3 years ago