JHU-CLSP / RATIONALYST
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044
☆30Updated last month
Related projects ⓘ
Alternatives and complementary repositories for RATIONALYST
- ☆25Updated last month
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 9 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆45Updated last month
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- ☆40Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- This is the official repository for all the code of TheoremLlama☆30Updated last month
- ☆61Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆79Updated this week
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆25Updated last year
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆42Updated 4 months ago
- ☆35Updated 2 weeks ago
- ☆49Updated 6 months ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆42Updated 4 months ago
- This is the official repository for Inheritune.