NJUNLP / x-LLM
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for x-LLM
- Unofficial implementation of AlpaGasus☆84Updated last year
- code for Teaching LM to Translate with Comparison☆37Updated 11 months ago
- ☆47Updated 3 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆31Updated 4 months ago
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 4 months ago
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆48Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- Code and Data Repo for ACL'23 Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated 10 months ago
- Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks?☆57Updated last year
- ☆89Updated last month
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆51Updated 3 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated 3 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆47Updated 4 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆83Updated 4 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆50Updated 7 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆34Updated last month
- ☆29Updated last year
- EMNLP2022 "Cross-Align: Modeling Deep Cross-lingual Interactions for Word Alignment"☆17Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆60Updated 8 months ago
- ☆59Updated last year
- Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆64Updated last week
- ☆27Updated 10 months ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆42Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆30Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆126Updated 2 months ago
- GPT as Human☆18Updated 10 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆70Updated 9 months ago