WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning

This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DPO and KTO.
20Updated last month

Alternatives and similar repositories for Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning:

Users that are interested in Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning are comparing it to the libraries listed below