IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆147Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for SALMON
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆127Updated 2 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆97Updated 2 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆198Updated 3 weeks ago
- Simple next-token-prediction for RLHF☆220Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆108Updated 2 months ago
- ☆158Updated last year
- ☆115Updated 4 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆65Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆118Updated 4 months ago
- ☆89Updated 11 months ago
- ☆101Updated 5 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆44Updated last month
- Reformatted Alignment☆112Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 9 months ago
- ☆90Updated 4 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆104Updated 5 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated 3 months ago
- ☆259Updated last year
- ☆95Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- A repository for transformer critique learning and generation☆86Updated 11 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆83Updated last week
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆44Updated 6 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆83Updated last week
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆125Updated last year
- Code for the paper <SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning>☆45Updated last year
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- ☆112Updated last month