icip-cas/awesome-auto-alignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/icip-cas/awesome-auto-alignment)

icip-cas / awesome-auto-alignment

Collection of papers for scalable automated alignment.

☆93

Alternatives and similar repositories for awesome-auto-alignment

Users that are interested in awesome-auto-alignment are comparing it to the libraries listed below

Sorting:

c-box / KnowledgeLifecycle
View on GitHub
Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"
☆59Aug 24, 2023Updated 2 years ago
feiyang-k / AutoScale
View on GitHub
Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…
☆13Aug 8, 2025Updated 7 months ago
GAIR-NLP / OlympicArena
View on GitHub
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆107Mar 6, 2025Updated last year
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated last year
sauc-abadal / ALT
View on GitHub
Official repository for ALT (ALignment with Textual feedback).
☆10Jul 25, 2024Updated last year
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆12Oct 12, 2024Updated last year
icip-cas / Verifier-Engineering
View on GitHub
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
☆63Dec 5, 2024Updated last year
mathllm / Step-Controlled_DPO
View on GitHub
☆23Jul 5, 2024Updated last year
KodCode-AI / code-r1
View on GitHub
Reproducing R1 for Code with Reliable Rewards
☆12Apr 9, 2025Updated 11 months ago
StefanHeng / ProgGen
View on GitHub
Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"
☆17Mar 29, 2024Updated last year
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆73Jun 25, 2024Updated last year
Abbey4799 / CELLO
View on GitHub
Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)
☆50Apr 19, 2024Updated last year
Ablustrund / APPS_Plus
View on GitHub
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆74Aug 31, 2024Updated last year
thu-coai / ComplexBench
View on GitHub
Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)
☆102Feb 20, 2025Updated last year
panruotong / CAG
View on GitHub
Implementation of Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Paper: https://arxiv.org/abs/2404.06809
☆22Oct 22, 2024Updated last year
hbin0701 / Self-Explore
View on GitHub
[𝐄𝐌𝐍𝐋𝐏 𝐅𝐢𝐧𝐝𝐢𝐧𝐠𝐬 𝟐𝟎𝟐𝟒 & 𝐀𝐂𝐋 𝟐𝟎𝟐𝟒 𝐍𝐋𝐑𝐒𝐄 𝐎𝐫𝐚𝐥] 𝘌𝘯𝘩𝘢𝘯𝘤𝘪𝘯𝘨 𝘔𝘢𝘵𝘩𝘦𝘮𝘢𝘵𝘪𝘤𝘢𝘭 𝘙𝘦𝘢𝘴𝘰𝘯𝘪𝘯…
☆51May 4, 2024Updated last year
LCM-Lab / LOGO
View on GitHub
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆24Oct 10, 2025Updated 4 months ago
QwenLM / AutoIF
View on GitHub
☆325Jul 25, 2024Updated last year
icip-cas / ChatAlpaca
View on GitHub
A Multi-Turn Dialogue Corpus based on Alpaca Instructions
☆177Jun 1, 2023Updated 2 years ago
THUDM / AlignBench
View on GitHub
大模型多维度中文对齐评测基准 (ACL 2024)
☆421Oct 25, 2025Updated 4 months ago
icip-cas / SSO
View on GitHub
A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…
☆20Nov 21, 2024Updated last year
JoshuaPurtell / SmallBench
View on GitHub
Small, simple agent task environments for training and evaluation
☆19Nov 1, 2024Updated last year
Linear95 / APO
View on GitHub
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
☆56Jun 3, 2024Updated last year
THUDM / LongReward
View on GitHub
☆62Oct 29, 2024Updated last year
AlongWY / apex_wheels
View on GitHub
☆28Oct 9, 2025Updated 5 months ago
GaryYufei / AlignLLMHumanSurvey
View on GitHub
Aligning Large Language Models with Human: A Survey
☆741Sep 11, 2023Updated 2 years ago
NVIDIA / NeMo-Aligner
View on GitHub
Scalable toolkit for efficient model alignment
☆849Oct 6, 2025Updated 5 months ago
KbsdJames / Awesome-LLM-Preference-Learning
View on GitHub
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆189Oct 28, 2024Updated last year
SihengLi99 / LLM-Honesty-Survey
View on GitHub
[2025-TMLR] A Survey on the Honesty of Large Language Models
☆64Dec 8, 2024Updated last year
liziniu / ReMax
View on GitHub
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
☆201Dec 16, 2023Updated 2 years ago
OpenBMB / Eurus
View on GitHub
☆320Sep 18, 2024Updated last year
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆23Mar 12, 2024Updated last year
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆589Dec 9, 2024Updated last year
halfrot / ALaRM
View on GitHub
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
☆25Mar 28, 2024Updated last year
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆30Mar 5, 2024Updated 2 years ago
princeton-nlp / LLMBar
View on GitHub
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆137Jul 8, 2024Updated last year
jmeridth / jmeridth.github.io
View on GitHub
Jason Meridth's blog
☆13Updated this week
RUCAIBox / JiuZhang3.0
View on GitHub
The code and data for the paper JiuZhang3.0
☆49May 26, 2024Updated last year
GothenburgBitFactory / tw.org
View on GitHub
Repository for tw.org site
☆14Feb 11, 2026Updated 3 weeks ago