SparkJiao / dpo-trajectory-reasoning

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
73Updated 2 months ago

Alternatives and similar repositories for dpo-trajectory-reasoning:

Users that are interested in dpo-trajectory-reasoning are comparing it to the libraries listed below