byronBBL / Context-DPO
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆14Updated last month
Alternatives and similar repositories for Context-DPO:
Users that are interested in Context-DPO are comparing it to the libraries listed below
- Evaluate the Quality of Critique☆35Updated 7 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- BeHonest: Benchmarking Honesty in Large Language Models☆31Updated 5 months ago
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆23Updated 10 months ago
- ☆72Updated 8 months ago
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆13Updated last month
- ☆40Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆34Updated last year
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆14Updated 3 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆68Updated last month
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆41Updated 2 months ago
- A method of ensemble learning for heterogeneous large language models.☆33Updated 5 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆18Updated 4 months ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆44Updated last month
- L-CITEEVAL: DO LONG-CONTEXT MODELS TRULY LEVERAGE CONTEXT FOR RESPONDING?☆22Updated 3 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆20Updated 3 months ago
- ☆33Updated 10 months ago
- This is the code repo for our paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆23Updated last month
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆23Updated 10 months ago
- GPT as Human☆18Updated last month
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 5 months ago
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆25Updated last year
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆55Updated last month
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- Towards Systematic Measurement for Long Text Quality☆31Updated 4 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆62Updated 11 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆43Updated 7 months ago
- AbstainQA, ACL 2024☆25Updated 3 months ago
- ☆16Updated 2 months ago