☆98May 30, 2023Updated 2 years ago
Alternatives and similar repositories for reward-modeling
Users that are interested in reward-modeling are comparing it to the libraries listed below
Sorting:
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- A (somewhat) minimal library for finetuning language models with PPO on human feedback.☆90Nov 23, 2022Updated 3 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,738Jan 8, 2024Updated 2 years ago
- ☆35Jan 29, 2023Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.☆50Jul 10, 2023Updated 2 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…☆564May 9, 2024Updated last year
- ☆158Mar 18, 2023Updated 2 years ago
- A modular RL library to fine-tune language models to human preferences☆2,380Mar 1, 2024Updated 2 years ago
- ☆12Jan 17, 2025Updated last year
- ☆33Apr 23, 2023Updated 2 years ago
- Training AI for Super Smash Bros. Melee☆32Mar 27, 2025Updated 11 months ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- ☆14May 8, 2023Updated 2 years ago
- Code for the paper Fine-Tuning Language Models from Human Preferences☆1,380Jul 25, 2023Updated 2 years ago
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Feb 13, 2024Updated 2 years ago
- One stop shop for all things carp☆59Sep 9, 2022Updated 3 years ago
- ☆14Aug 15, 2024Updated last year
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆39Jun 16, 2024Updated last year
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,824Jun 17, 2025Updated 8 months ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Jun 26, 2021Updated 4 years ago
- ☆18Feb 20, 2024Updated 2 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Redwood Research's transformer interpretability tools☆15Apr 15, 2022Updated 3 years ago
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆211Jul 31, 2023Updated 2 years ago
- The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o☆19Apr 15, 2023Updated 2 years ago
- Implementation of MixCE method described in ACL 2023 paper by Zhang et al.☆20May 29, 2023Updated 2 years ago
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 5 months ago
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆174Apr 7, 2023Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- Expanding natural instructions☆1,035Dec 11, 2023Updated 2 years ago
- Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'☆27May 16, 2025Updated 9 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- ☆25May 28, 2025Updated 9 months ago
- A Benchmark Dataset for Multimodal Scientific Fact Checking☆26Sep 17, 2024Updated last year
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆31Apr 1, 2025Updated 11 months ago