GaryYufei/AlignLLMHumanSurvey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/GaryYufei/AlignLLMHumanSurvey)

GaryYufei / AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

☆742

Alternatives and similar repositories for AlignLLMHumanSurvey

Users that are interested in AlignLLMHumanSurvey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 9 months ago
opendilab / awesome-RLHF
View on GitHub
A curated list of reinforcement learning with human feedback resources (continually updated)
☆4,416May 20, 2026Updated 2 months ago
MLGroupJLU / LLM-eval-survey
View on GitHub
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
☆1,610Apr 17, 2026Updated 3 months ago
PKU-Alignment / safe-rlhf
View on GitHub
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
☆1,610Nov 24, 2025Updated 8 months ago
zjunlp / Prompt4ReasoningPapers
View on GitHub
[ACL 2023] Reasoning with Language Model Prompting: A Survey
☆1,009May 21, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GanjinZero / RRHF
View on GitHub
[NIPS2023] RRHF & Wombat
☆805Sep 22, 2023Updated 2 years ago
OpenLMLab / MOSS-RLHF
View on GitHub
Secrets of RLHF in Large Language Models Part I: PPO
☆1,426Mar 3, 2024Updated 2 years ago
SinclairCoder / Instruction-Tuning-Papers
View on GitHub
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Jul 20, 2023Updated 3 years ago
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆368Dec 29, 2023Updated 2 years ago
icip-cas / awesome-auto-alignment
View on GitHub
Collection of papers for scalable automated alignment.
☆92Oct 22, 2024Updated last year
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆18,934Updated this week
FranxYao / chain-of-thought-hub
View on GitHub
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
☆2,776Aug 4, 2024Updated last year
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…
☆9,853Jul 14, 2026Updated last week
teacherpeterpan / self-correction-llm-papers
View on GitHub
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
☆573Oct 28, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yizhongw / self-instruct
View on GitHub
Aligning pretrained language models with instruction data generated by themselves.
☆4,606Mar 27, 2023Updated 3 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,645May 26, 2026Updated 2 months ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆600Dec 9, 2024Updated last year
anthropics / hh-rlhf
View on GitHub
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,853Jun 17, 2025Updated last year
IBM / Dromedary
View on GitHub
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,138Sep 18, 2025Updated 10 months ago
jeffhj / LM-reasoning
View on GitHub
This repository contains a collection of papers and resources on Reasoning in Large Language Models.
☆572Nov 13, 2023Updated 2 years ago
Timothyxxx / Chain-of-ThoughtsPapers
View on GitHub
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
☆2,105Oct 5, 2023Updated 2 years ago
Paitesanshi / LLM-Agent-Survey
View on GitHub
☆2,909Feb 20, 2025Updated last year
RUCAIBox / LLMSurvey
View on GitHub
The official GitHub page for the survey paper "A Survey of Large Language Models".
☆12,194Mar 11, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
WooooDyy / LLM-Agent-Paper-List
View on GitHub
The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et a…
☆8,169Sep 12, 2025Updated 10 months ago
atfortes / Awesome-LLM-Reasoning
View on GitHub
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
☆3,654Apr 20, 2026Updated 3 months ago
tatsu-lab / alpaca_eval
View on GitHub
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆2,007Aug 9, 2025Updated 11 months ago
hyp1231 / awesome-llm-powered-agent
View on GitHub
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
☆2,251Apr 30, 2025Updated last year
dqxiu / ICL_PaperList
View on GitHub
Paper List for In-context Learning 🌷
☆876Oct 8, 2024Updated last year
THU-KEG / EvaluationPapers4ChatGPT
View on GitHub
Resource, Evaluation and Detection Papers for ChatGPT
☆456Mar 21, 2024Updated 2 years ago
NVIDIA / NeMo-Aligner
View on GitHub
Scalable toolkit for efficient model alignment
☆851Oct 6, 2025Updated 9 months ago
RLHFlow / RLHF-Reward-Modeling
View on GitHub
Recipes to train reward model for RLHF.
☆1,534Apr 24, 2025Updated last year
allenai / RL4LMs
View on GitHub
A modular RL library to fine-tune language models to human preferences
☆2,393Mar 1, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
GAIR-NLP / O1-Journey
View on GitHub
O1 Replication Journey
☆2,001Jan 14, 2025Updated last year
eric-mitchell / direct-preference-optimization
View on GitHub
Reference implementation for DPO (Direct Preference Optimization)
☆2,899Aug 11, 2024Updated last year
thunlp / PromptPapers
View on GitHub
Must-read papers on prompt-based tuning for pre-trained language models.
☆4,323Jul 17, 2023Updated 3 years ago
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,332Jun 11, 2023Updated 3 years ago
thu-coai / Safety-Prompts
View on GitHub
Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。
☆1,190Feb 27, 2024Updated 2 years ago
Magnetic2014 / llm-alignment-survey
View on GitHub
A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…
☆82Sep 28, 2023Updated 2 years ago
thunlp / UltraChat
View on GitHub
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
☆2,875Mar 13, 2024Updated 2 years ago