tianyi-lab / DEBATunE
[ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
☆20Updated 4 months ago
Alternatives and similar repositories for DEBATunE:
Users that are interested in DEBATunE are comparing it to the libraries listed below
- [EMNLP'24 (Main)] DRPO(Dynamic Rewarding with Prompt Optimization) is a tuning-free approach for self-alignment. DRPO leverages a search-…☆18Updated 2 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- ☆64Updated 11 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 7 months ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆62Updated 2 years ago
- AbstainQA, ACL 2024☆25Updated 3 months ago
- We have released the code and demo program required for LLM with self-verification☆53Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated last year
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆28Updated 9 months ago
- Personality Alignment of Language Models☆19Updated 4 months ago
- ☆45Updated last year
- Evaluate the Quality of Critique☆35Updated 7 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 6 months ago
- Supporting code for ReCEval paper☆27Updated 4 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 10 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆31Updated 8 months ago
- ☆84Updated 2 years ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- ☆18Updated 6 months ago
- Contrastive Chain-of-Thought Prompting☆57Updated last year
- A framework for evolving and testing question-answering datasets with various models.☆13Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- awesome-LLM-controlled-constrained-generation☆33Updated 5 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆45Updated 10 months ago
- ☆20Updated this week
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 10 months ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆18Updated 3 months ago
- ☆44Updated 4 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆33Updated 6 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago