THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
292Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for ReST-MCTS