vicgalle / refined-dpo
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
☆13Updated last year
Alternatives and similar repositories for refined-dpo:
Users that are interested in refined-dpo are comparing it to the libraries listed below
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆44Updated 2 months ago
- ☆23Updated 2 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year
- ☆68Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆27Updated 7 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated last month
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆22Updated 6 months ago
- ☆27Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆40Updated last month
- Tasks for describing differences between text distributions.☆16Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- ☆22Updated 3 months ago
- ☆13Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆44Updated 3 months ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆73Updated last year
- ☆27Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆23Updated 2 weeks ago
- ☆43Updated 9 months ago
- ☆38Updated 5 months ago
- Knowledge Unlearning for Large Language Models☆22Updated 3 weeks ago