PKU-Baichuan-MLSystemLab / SysBench
SysBench: Can Large Language Models Follow System Messages?
☆20Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for SysBench
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆23Updated 11 months ago
- ☆16Updated 8 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)☆44Updated 6 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆35Updated last year
- Towards Systematic Measurement for Long Text Quality☆28Updated 2 months ago
- ☆27Updated 9 months ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Updated last week
- ☆51Updated 3 months ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆24Updated 2 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆42Updated last year
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆30Updated 3 weeks ago
- ☆26Updated last year
- ☆36Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆37Updated 4 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆77Updated last year
- ☆47Updated 2 months ago
- Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"☆86Updated last week
- ☆21Updated last year
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆34Updated 5 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- self-adaptive in-context learning☆41Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models☆40Updated 10 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆61Updated 3 months ago
- This is the repository for paper "CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models"☆22Updated last year
- ☆25Updated last month
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆22Updated 3 months ago
- Collection of papers for scalable automated alignment.☆71Updated 2 weeks ago
- EMNLP'2024: Knowledge Verification to Nip Hallucination in the Bud☆19Updated 8 months ago
- Implementation of "ACL'24: When Do LLMs Need Retrieval Augmentation? Mitigating LLMs’ Overconfidence Helps Retrieval Augmentation"☆19Updated 3 months ago