YJiangcm / BMCLinks
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Updated 11 months ago
Alternatives and similar repositories for BMC
Users that are interested in BMC are comparing it to the libraries listed below
Sorting:
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated last year
- ☆16Updated last year
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆22Updated last month
- ☆26Updated 2 months ago
- [ACL 2025 (Findings)] DEMO: Reframing Dialogue Interaction with Fine-grained Element Modeling☆20Updated last year
- Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning☆12Updated last year
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Updated last year
- ☆46Updated 3 months ago
- ☆14Updated 11 months ago
- ☆20Updated 9 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- ☆15Updated last year
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Updated 2 months ago
- ☆14Updated 2 years ago
- ☆23Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆24Updated last year
- Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)☆28Updated 2 years ago
- ☆16Updated last year
- [EMNLP 2025] Verification Engineering for RL in Instruction Following☆46Updated 3 months ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆44Updated 4 months ago
- ☆30Updated last year
- Evaluating the faithfulness of long-context language models☆30Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆55Updated 3 weeks ago
- ☆22Updated 5 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Updated last year
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27Updated 7 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Updated 3 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 2 months ago
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆18Updated 11 months ago