vicgalle / refined-dpoLinks

Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs

☆13

Alternatives and similar repositories for refined-dpo

Users that are interested in refined-dpo are comparing it to the libraries listed below

Sorting:

technion-cs-nlp / hallucination-mitigation
☆22Updated 6 months ago
abhika-m / FAVA
☆72Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆54Updated last year
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆34Updated 9 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆25Updated 3 months ago
dinobby / MAgICoRE
☆24Updated 9 months ago
tianyi-lab / DEBATunE
[ACL'24] Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
☆23Updated 9 months ago
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆25Updated 6 months ago
likenneth / persona_drift
Measuring and Controlling Persona Drift in Language Model Dialogs
☆17Updated last year
ctlllll / reward_collapse
☆27Updated 2 years ago
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆74Updated last year
cambridgeltl / PairS
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)
☆47Updated 5 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 5 months ago
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆35Updated last year
codezakh / DataEnvGym
A testbed for agents and environments that can automatically improve models through data generation.
☆24Updated 3 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆34Updated last year
ryokamoi / llm-self-correction-papers
List of papers on Self-Correction of LLMs.
☆73Updated 6 months ago
SalesforceAIResearch / FoFo
☆24Updated 5 months ago
austrian-code-wizard / c3po
☆27Updated this week
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆35Updated last year
kyegomez / EAOT
The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"
☆20Updated last year
scandukuri / assistant-gate
☆25Updated last year
nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
PrasannS / rlhf-length-biases
☆28Updated last year
allenai / marg-reviewer
Code/data for MARG (multi-agent review generation)
☆44Updated 7 months ago
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆37Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆30Updated this week