benpry / why-think-step-by-step
Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"
☆40Updated last year
Related projects ⓘ
Alternatives and complementary repositories for why-think-step-by-step
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆25Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆30Updated 9 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆75Updated 3 weeks ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆43Updated 3 weeks ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated 10 months ago
- ☆28Updated 7 months ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- ☆42Updated 4 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆79Updated this week
- This is the official repository for all the code of TheoremLlama☆30Updated last month
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆37Updated last year
- ☆50Updated last month
- Evaluating LLMs with CommonGen-Lite☆84Updated 7 months ago
- ☆68Updated 2 months ago
- ☆40Updated last week
- ☆44Updated last month
- ☆26Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- ☆31Updated 2 weeks ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆45Updated last month
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆20Updated this week
- ☆49Updated 6 months ago
- DPO, but faster 🚀☆21Updated 2 weeks ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆54Updated 4 months ago