PKU-Alignment / safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
☆1,336Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for safe-rlhf
- MOSS-RLHF☆1,290Updated 8 months ago
- [NIPS2023] RRHF & Wombat☆797Updated last year
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)☆2,526Updated this week
- ☆870Updated 3 months ago
- ☆887Updated 5 months ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,156Updated 2 months ago
- A plug-and-play library for parameter-efficient-tuning (Delta Tuning)☆996Updated last month
- Open Academic Research on Improving LLaMA to SOTA LLM☆1,607Updated last year
- Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.☆967Updated 6 months ago
- ☆707Updated 4 months ago
- Aligning Large Language Models with Human: A Survey☆698Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆701Updated this week
- We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tunin…☆2,612Updated 10 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆411Updated 2 months ago
- ☆451Updated 5 months ago
- Paper List for In-context Learning 🌷☆815Updated last month
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆275Updated last year
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆880Updated 4 months ago
- Efficient Training (including pre-training and fine-tuning) for Big Models☆560Updated 3 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆979Updated this week
- ☆312Updated 3 months ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,362Updated last year
- Best practice for training LLaMA models in Megatron-LM☆627Updated 10 months ago
- 面向中文大模型价值观的评估与对齐研究☆473Updated last year
- 开源SFT数据集整理,随时补充☆440Updated last year
- LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案☆1,162Updated 10 months ago
- Recipes to train reward model for RLHF.☆788Updated this week
- personal chatgpt☆315Updated last week
- ☆272Updated 6 months ago
- Implementation of Chinese ChatGPT☆286Updated 11 months ago