yinyueqin / relative-preference-optimization

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
16Updated 6 months ago

Related projects: