Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆62Aug 30, 2024Updated last year
Alternatives and similar repositories for CLAIR_and_APO
Users that are interested in CLAIR_and_APO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Sep 1, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆82Jun 19, 2024Updated last year
- Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"☆24Apr 24, 2025Updated 11 months ago
- Distributed multi-agent framework for event-driven, graph-based computation. Elixir/Python, NATS event streaming, modular operator/XCS ar…☆14Mar 25, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆33Jan 6, 2025Updated last year
- Code of "Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment" (2025).☆14Apr 4, 2025Updated last year
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆11Jan 10, 2025Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- ☆10Oct 24, 2024Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated last year
- ☆131Oct 1, 2024Updated last year
- [ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning☆54Jul 28, 2024Updated last year
- A recipe for online RLHF and online iterative DPO.☆543Dec 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆15Apr 18, 2024Updated last year
- ☆55Jan 15, 2026Updated 3 months ago
- https://liuzeming01.github.io/XDailyDialog/☆14Jun 25, 2023Updated 2 years ago
- ☆13Jun 4, 2024Updated last year
- Official repository for ORPO☆475May 31, 2024Updated last year
- Benchmarking Benchmark Leakage in Large Language Models☆60May 20, 2024Updated last year
- a library which can be used to create story driven clustered load-testing packages through a very readable and understandable api.☆30May 20, 2010Updated 15 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Reactive DDD with DSPy☆23Feb 24, 2024Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 9 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- SIFT: Grounding LLM Reasoning in Contexts via Stickers☆57Mar 6, 2025Updated last year
- A Structured Output Benchmark whose 'ground-truth' is actually right☆19Dec 5, 2025Updated 4 months ago
- ☆24Jan 28, 2025Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆257Oct 30, 2024Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Training and Benchmarking LLMs for Code Preference.☆38Nov 15, 2024Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Jun 5, 2023Updated 2 years ago
- RS-IMLE☆44Dec 7, 2024Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Aug 2, 2024Updated last year
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- Public Inflection Benchmarks☆67Mar 6, 2024Updated 2 years ago
- A deep research framework☆27Feb 3, 2026Updated 2 months ago