Linear95/APO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Linear95/APO)

Linear95 / APO

Code for ACL2024 paper - Adversarial Preference Optimization (APO).

☆54

Alternatives and similar repositories for APO

Users that are interested in APO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Linear95 / DSP
View on GitHub
Domain-specific preference (DSP) data and customized RM fine-tuning.
☆25Mar 7, 2024Updated 2 years ago
ctlllll / reward_collapse
View on GitHub
☆26May 30, 2023Updated 3 years ago
Linear95 / SPAG
View on GitHub
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆145Feb 24, 2025Updated last year
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆13Apr 9, 2025Updated last year
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
Spico197 / writing-comrade
View on GitHub
✒️ ChatGPT as a writing partner.
☆14Mar 6, 2023Updated 3 years ago
whyNLP / Conic10K
View on GitHub
Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.
☆33Dec 6, 2023Updated 2 years ago
yangzhch6 / DARS
View on GitHub
The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration" (ICML 2026)
☆24Feb 4, 2026Updated 5 months ago
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
2003pro / ScaleBiO
View on GitHub
This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
☆25Jul 30, 2024Updated last year
lsvih / MWA
View on GitHub
Example code for "Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention", ACL2020
☆17May 8, 2022Updated 4 years ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆41Nov 11, 2025Updated 8 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
GanjinZero / BioBART
View on GitHub
BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model [ACL-BioNLP 2022]
☆52Oct 26, 2022Updated 3 years ago
shadowkiller33 / Contrast-Instruction
View on GitHub
☆19Oct 2, 2023Updated 2 years ago
Yuanhy1997 / Auto-Diagnosis-by-RL-and-Classification
View on GitHub
Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]
☆14May 20, 2022Updated 4 years ago
hkust-nlp / B-STaR
View on GitHub
B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
☆86May 21, 2025Updated last year
jason9693 / ETA4LLMs
View on GitHub
Calculating Expected Time for training LLM.
☆39Apr 17, 2023Updated 3 years ago
thu-ml / LM-Calibration
View on GitHub
☆17May 31, 2023Updated 3 years ago
RyanLiu112 / MRN
View on GitHub
[NeurIPS 2022] Official codebase for "Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learn…
☆26Feb 15, 2025Updated last year
Shentao-YANG / Preference_Grounded_Guidance
View on GitHub
Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).
☆17Jan 8, 2025Updated last year
bcmi220 / esc4nmt
View on GitHub
Explicit Sentence Compression for Neural Machine Translation
☆10May 12, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luka-group / SuRE
View on GitHub
[EMNLP 2022] Summarization as Indirect Supervision for Relation Extraction (SuRE)
☆27Nov 22, 2022Updated 3 years ago
GanjinZero / GTS
View on GitHub
Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]
☆16Jan 28, 2022Updated 4 years ago
google-research-datasets / wikifact
View on GitHub
Wikipedia based dataset to train relationship classifiers and fact extraction models
☆25May 25, 2021Updated 5 years ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
EleutherAI / mdl
View on GitHub
Minimum Description Length probing for neural network representations
☆20Jan 28, 2025Updated last year
luosx18 / UED
View on GitHub
Code and data for "An Accurate Unsupervised Method for Joint Entity Alignment and Dangling Entity Detection".
☆15Mar 26, 2022Updated 4 years ago
neelsjain / BYOD
View on GitHub
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆108Sep 23, 2023Updated 2 years ago
RUCBM / DeepCritic
View on GitHub
Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"
☆41Jun 24, 2025Updated last year
PKU-Alignment / AlignmentSurvey
View on GitHub
AI Alignment: A Comprehensive Survey
☆137Nov 2, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NuoJohnChen / JudgeLRM
View on GitHub
JudgeLRM: Large Reasoning Models as a Judge
☆42May 6, 2026Updated 2 months ago
Linear95 / DetGP
View on GitHub
Code for the AAAI 2020 oral paper - Dynamic Embedding on Textual Networks via a Gaussian Process.
☆12Mar 26, 2020Updated 6 years ago
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
zhaoyu-li / PyEuclid
View on GitHub
[CAV 2025] PyEuclid: A Versatile Formal Plane Geometry System in Python
☆15Jun 27, 2025Updated last year
qinlibo-hit / CI-ToD
View on GitHub
PyTorch code for EMNLP 2021 paper: Don't be Contradicted with Anything! CI-ToD: Towards Benchmarking Consistency for Task-oriented Dialog…
☆28Oct 4, 2021Updated 4 years ago
zzli2022 / TLDR
View on GitHub
Code for Research Project TLDR
☆26Jul 28, 2025Updated 11 months ago
wellecks / llemma_formal2formal
View on GitHub
Llemma formal2formal (tactic prediction) theorem proving experiments
☆20Oct 17, 2023Updated 2 years ago