Linear95 / SPAG
Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024
☆122Updated 3 weeks ago
Alternatives and similar repositories for SPAG:
Users that are interested in SPAG are comparing it to the libraries listed below
- ☆95Updated 8 months ago
- ☆135Updated 3 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆221Updated last week
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- Repo of paper "Free Process Rewards without Process Labels"☆136Updated this week
- ☆156Updated last week
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 6 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆183Updated 8 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆118Updated 6 months ago
- ☆102Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆149Updated 11 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆142Updated 4 months ago
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆52Updated 9 months ago
- ☆143Updated 3 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆163Updated 2 weeks ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆72Updated 9 months ago
- A simple unified framework for evaluating LLMs☆204Updated last week
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆97Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"☆51Updated last month
- Replicating O1 inference-time scaling laws☆83Updated 3 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆52Updated 9 months ago
- ☆111Updated 3 weeks ago
- Reformatted Alignment☆114Updated 5 months ago
- RLHF implementation details of OAI's 2019 codebase☆183Updated last year
- The official implementation of Self-Exploring Language Models (SELM)☆62Updated 9 months ago