ai-in-pm / rStar-Math
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
☆21Updated last week
Alternatives and similar repositories for rStar-Math:
Users that are interested in rStar-Math are comparing it to the libraries listed below
- ☆40Updated last month
- This the implementation of LeCo☆30Updated this week
- ☆49Updated 4 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆50Updated 7 months ago
- Reformatted Alignment☆113Updated 3 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆57Updated this week
- ☆27Updated last month
- ☆23Updated 4 months ago
- ☆53Updated 3 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆39Updated 3 weeks ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆77Updated 3 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆102Updated last month
- ☆48Updated 10 months ago
- ☆88Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆25Updated 10 months ago
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆17Updated 2 months ago
- ☆82Updated last week
- The Official Code Repository for GUI-World.☆44Updated last month
- ☆98Updated last month
- ☆58Updated 4 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆18Updated 10 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated last month
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆40Updated 6 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆48Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆42Updated 11 months ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆90Updated last month
- ☆13Updated 10 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 3 weeks ago