THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
316Updated last month

Related projects

Alternatives and complementary repositories for ReST-MCTS