SparkJiao / dpo-trajectory-reasoning
View external linksLinks

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
83Jan 14, 2025Updated last year

Alternatives and similar repositories for dpo-trajectory-reasoning

Users that are interested in dpo-trajectory-reasoning are comparing it to the libraries listed below

Sorting:

Are these results useful?