NJUNLP / x-LLM
☆27Updated last year
Alternatives and similar repositories for x-LLM:
Users that are interested in x-LLM are comparing it to the libraries listed below
- First explanation metric (diagnostic report) for text generation evaluation☆63Updated 6 months ago
- ☆57Updated last month
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 6 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- ☆52Updated 5 months ago
- code for Teaching LM to Translate with Comparison☆38Updated last year
- ☆29Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 11 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 2 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆34Updated 6 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"☆94Updated last month
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆12Updated 4 months ago
- Code and data for "Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation" (EMNLP 2023…☆30Updated 9 months ago
- Code and Data Repo for [ACL 2023] Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆66Updated 6 months ago
- GPT as Human☆18Updated last month
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆66Updated 2 weeks ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆36Updated 4 months ago
- Towards Systematic Measurement for Long Text Quality☆31Updated 4 months ago
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆25Updated last year
- ☆60Updated 2 years ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆43Updated 7 months ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆56Updated last year
- ☆12Updated 2 years ago
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆17Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆42Updated last month