NJUNLP / x-LLM
☆25Updated 11 months ago
Related projects: ⓘ
- ☆44Updated 3 weeks ago
- [EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge☆22Updated 10 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated last month
- Code for Suri: Multi-constraint instruction following for long-form text generation☆15Updated last week
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆38Updated last month
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆29Updated 2 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated last year
- ☆16Updated 6 months ago
- Unofficial implementation of AlpaGasus☆83Updated 11 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆54Updated 6 months ago
- ☆57Updated last year
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆62Updated 3 months ago
- ☆26Updated last year
- code for Teaching LM to Translate with Comparison☆38Updated 9 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆34Updated 2 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆33Updated 2 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆59Updated 2 months ago
- ☆26Updated 8 months ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated last year
- A collection of instruction data and scripts for machine translation.☆20Updated 11 months ago
- TBC☆26Updated last year
- ☆39Updated 9 months ago
- Code base of In-Context Learning for Dialogue State tracking☆43Updated 11 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆17Updated last month
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆62Updated 9 months ago
- GPT as Human☆17Updated 8 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 3 months ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆25Updated 10 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago
- Contrastive Chain-of-Thought Prompting☆50Updated 10 months ago