WeiXiongUST / Building-Math-Agents-with-Multi-Turn-Iterative-Preference-LearningView on GitHub
This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DPO and KTO.
32Dec 5, 2024Updated last year

Alternatives and similar repositories for Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning

Users that are interested in Building-Math-Agents-with-Multi-Turn-Iterative-Preference-Learning are comparing it to the libraries listed below

Sorting:

Are these results useful?