thunlp / SubCharTokenizationLinks
☆44Updated 2 years ago
Alternatives and similar repositories for SubCharTokenization
Users that are interested in SubCharTokenization are comparing it to the libraries listed below
Sorting:
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- ☆32Updated 2 years ago
- Hierarchical Sketch Induction for Paraphrase Generation (Hosking et al., ACL 2022)☆51Updated last year
- ☆41Updated 2 years ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆177Updated 9 months ago
- ☆35Updated 2 years ago
- ☆40Updated 3 years ago
- ACL Paper Lists(machine translation)☆13Updated 3 years ago
- reStructured Pre-training☆98Updated 2 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆29Updated last year
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆79Updated 2 years ago
- Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues☆43Updated 3 years ago
- Code for the ACL 2022 paper "Contextual Representation Learning beyond Masked Language Modeling"☆33Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆104Updated last year
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 7 months ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Updated last year
- Tools for formatting WMT hypothesis and test sets in XML☆27Updated 5 months ago
- 首个中文心理咨询对话安全检测数据集☆21Updated last year
- PETCI: A Parallel English Translation Dataset of Chinese Idioms☆26Updated 3 years ago
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago
- On Transferability of Prompt Tuning for Natural Language Processing☆100Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆173Updated 2 years ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆56Updated 2 years ago
- ☆138Updated 4 years ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated last year
- code for Teaching LM to Translate with Comparison☆39Updated last year
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆64Updated last year