yinyueqin / relative-preference-optimization
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
☆16Updated 6 months ago
Related projects: ⓘ
- ☆23Updated 2 months ago
- [ACL'2024] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆44Updated last month
- ☆31Updated 8 months ago
- ☆25Updated 11 months ago
- ☆40Updated 5 months ago
- ☆73Updated 8 months ago
- [ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…☆16Updated 2 months ago
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆47Updated last month
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆51Updated 3 months ago
- Official implementation for the paper *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆57Updated 3 weeks ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆78Updated last week
- The source code of the EMNLP 2023 main conference paper: Sparse Low-rank Adaptation of Pre-trained Language Models.☆62Updated 6 months ago
- This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strat…☆66Updated 5 months ago
- FusionBench: A Comprehensive Benchmark of Deep Model Fusion☆42Updated 2 weeks ago
- ☆53Updated 5 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆40Updated 2 weeks ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆59Updated 7 months ago
- ☆20Updated last month
- ☆21Updated 2 months ago
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆32Updated 10 months ago
- my commonly-used tools☆46Updated last month
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆85Updated 8 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆61Updated 9 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆56Updated 7 months ago
- ☆38Updated 8 months ago
- ☆54Updated 2 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆33Updated last month
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆55Updated last year
- [Arxiv] Calibrated Self-Rewarding Vision Language Models☆35Updated 3 months ago
- ☆20Updated 4 months ago