ContextualAI/CLAIR_and_APO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ContextualAI/CLAIR_and_APO)

ContextualAI / CLAIR_and_APO

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

☆62

Alternatives and similar repositories for CLAIR_and_APO

Users that are interested in CLAIR_and_APO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dkubeai / langrunner
View on GitHub
☆17Sep 1, 2024Updated last year
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated 2 years ago
jmanhype / ace-adaptive-code-evolution
View on GitHub
ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.
☆12Mar 25, 2026Updated 4 months ago
QwenLM / online_merging_optimizers
View on GitHub
Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
☆82Jun 19, 2024Updated 2 years ago
jmanhype / hypergraph_agents_umbrella
View on GitHub
Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…
☆14Mar 25, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ddemszky / conversational-uptake
View on GitHub
Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"
☆25Apr 24, 2025Updated last year
CyberAgentAILab / regularized-bon
View on GitHub
Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).
☆14Apr 4, 2025Updated last year
UCSB-NLP-Chang / Prereq_tune
View on GitHub
Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"
☆11Jan 10, 2025Updated last year
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
fsndzomga / open_source_lrm
View on GitHub
☆10Oct 24, 2024Updated last year
jmanhype / DSPy-Multi-Document-Agents
View on GitHub
An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …
☆52Mar 25, 2026Updated 4 months ago
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
Asap7772 / understanding-rlhf
View on GitHub
Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…
☆32Apr 20, 2024Updated 2 years ago
swarnaHub / System-1.x
View on GitHub
PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models
☆25Jul 22, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eric-tramel / zoty
View on GitHub
Lightweight Zotero MCP server for AI agents
☆15May 16, 2026Updated 2 months ago
SALT-NLP / demonstrated-feedback
View on GitHub
☆131Oct 1, 2024Updated last year
TianduoWang / DPO-ST
View on GitHub
[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning
☆54Jul 28, 2024Updated last year
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
RLHFlow / Online-RLHF
View on GitHub
A recipe for online RLHF and online iterative DPO.
☆544Dec 28, 2024Updated last year
angie-chen55 / pref-learning-ranking-acc
View on GitHub
☆13Jun 4, 2024Updated 2 years ago
padas-lab-de / ir-rag-sigir24-persona-rag
View on GitHub
☆55Jun 23, 2026Updated last month
GAIR-NLP / benbench
View on GitHub
Benchmarking Benchmark Leakage in Large Language Models
☆61May 20, 2024Updated 2 years ago
seanchatmangpt / dslmodel
View on GitHub
Structured outputs from DSPy and Jinja2
☆27Jun 27, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xfactlab / orpo
View on GitHub
Official repository for ORPO
☆480May 31, 2024Updated 2 years ago
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
l4b4r4b4b4 / AIDocks
View on GitHub
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Feb 18, 2024Updated 2 years ago
SJTU-DENG-Lab / SIFT
View on GitHub
SIFT: Grounding LLM Reasoning in Contexts via Stickers
☆57Mar 6, 2025Updated last year
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
amazon-science / llm-code-preference
View on GitHub
Training and Benchmarking LLMs for Code Preference.
☆38Nov 15, 2024Updated last year
arcee-ai / EvolKit
View on GitHub
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆258Oct 30, 2024Updated last year
iliazlobin / dspy-research
View on GitHub
Experiments with DSPy — declarative, trainable LLM pipelines. Notebooks and demos for evaluating, composing, and optimizing LLM workflows…
☆10Jun 24, 2026Updated last month
feradauto / MoralCoT
View on GitHub
Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment
☆40Jun 5, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
uclaml / Rephrase-and-Respond
View on GitHub
Official repo of Respond-and-Respond: data, code, and evaluation
☆104Aug 2, 2024Updated last year
SerChirag / rs-imle
View on GitHub
RS-IMLE
☆44Dec 7, 2024Updated last year
adamkarvonen / SAE_BoardGameEval
View on GitHub
☆25Jan 28, 2025Updated last year
NVlabs / STL
View on GitHub
Official Pytorch Implementation of Self-emerging Token Labeling
☆35Mar 27, 2024Updated 2 years ago
tingyu215 / TS-LLaVA
View on GitHub
TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models
☆17Jan 2, 2025Updated last year
technion-cs-nlp / hallucination-mitigation
View on GitHub
☆23Dec 17, 2024Updated last year
InflectionAI / Inflection-Benchmarks
View on GitHub
Public Inflection Benchmarks
☆67Mar 6, 2024Updated 2 years ago