thunlp / SubCharTokenizationLinks
☆44Updated 2 years ago
Alternatives and similar repositories for SubCharTokenization
Users that are interested in SubCharTokenization are comparing it to the libraries listed below
Sorting:
- ☆40Updated last year
- ☆31Updated 2 years ago
- The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1…☆176Updated 6 months ago
- This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.☆51Updated 2 years ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆64Updated last year
- ☆40Updated 2 years ago
- ☆19Updated last year
- reStructured Pre-training☆98Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- ☆34Updated 2 years ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆172Updated 2 years ago
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆77Updated last year
- Code for the paper Code for the paper InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction Tuning☆100Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆94Updated 2 years ago
- [COLING22] An End-to-End Library for Evaluating Natural Language Generation☆92Updated last year
- A library for building hierarchical text representation and corresponding downstream applications.☆79Updated 11 months ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆56Updated 2 years ago
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated 2 years ago
- ☆71Updated 3 years ago
- OPD: Chinese Open-Domain Pre-trained Dialogue Model☆75Updated 2 years ago
- Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021☆61Updated 4 years ago
- Transitioning from Open-Domain Chit-Chat to Task-Oriented Dialogues☆43Updated 3 years ago
- Code for our ACL2021 paper Neural Machine Translation with Monolingual Translation Memory☆82Updated 2 years ago
- ☆134Updated 4 years ago
- Chinese AMR Corpus☆37Updated 3 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 4 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated 11 months ago
- This project maintains a reading list for general text generation tasks☆65Updated 3 years ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆55Updated last year