SparkJiao / dpo-trajectory-reasoning

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
62Updated this week

Alternatives and similar repositories for dpo-trajectory-reasoning:

Users that are interested in dpo-trajectory-reasoning are comparing it to the libraries listed below