Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆61Aug 30, 2024Updated last year
Alternatives and similar repositories for CLAIR_and_APO
Users that are interested in CLAIR_and_APO are comparing it to the libraries listed below
Sorting:
- ☆33Jan 6, 2025Updated last year
- ☆17Sep 1, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆66Nov 8, 2024Updated last year
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆25Jul 22, 2024Updated last year
- ☆130Oct 1, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 4 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆81Jun 19, 2024Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Zero-setup bash CLI that downloads full-resolution images from iCloud/Dropbox/Google Photos share links, bridging iPhone screenshots to r…☆33Feb 22, 2026Updated last week
- ☆54Jan 15, 2026Updated last month
- ☆10Oct 24, 2024Updated last year
- Code for the paper: Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics☆14Aug 9, 2024Updated last year
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated 11 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- https://liuzeming01.github.io/XDailyDialog/☆13Jun 25, 2023Updated 2 years ago
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆53Jul 28, 2024Updated last year
- Official repository for ORPO☆471May 31, 2024Updated last year
- A recipe for online RLHF and online iterative DPO.☆540Dec 28, 2024Updated last year
- ☆14Mar 28, 2024Updated last year
- ☆36Feb 23, 2026Updated last week
- Python library to add support for embedding natural code in Python with shared program state.☆23Jan 20, 2026Updated last month
- ☆13Jun 4, 2024Updated last year
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆46Sep 19, 2025Updated 5 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆60May 20, 2024Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated 2 weeks ago
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Apr 18, 2024Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated 11 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Public Inflection Benchmarks☆68Mar 6, 2024Updated last year