vicgalle / refined-dpoLinks
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
☆13Updated last year
Alternatives and similar repositories for refined-dpo
Users that are interested in refined-dpo are comparing it to the libraries listed below
Sorting:
- ☆22Updated 6 months ago
- ☆72Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- Codebase for Instruction Following without Instruction Tuning☆34Updated 9 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- ☆24Updated 9 months ago
- [ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements☆23Updated 9 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- The repository contains code for Adaptive Data Optimization☆25Updated 6 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆27Updated 2 years ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- ☆24Updated 5 months ago
- ☆27Updated this week
- Evaluate the Quality of Critique☆35Updated last year
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆20Updated last year
- ☆25Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- ☆28Updated last year
- Code/data for MARG (multi-agent review generation)☆44Updated 7 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆37Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week