SparkJiao / dpo-trajectory-reasoningLinks

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
78Updated 4 months ago

Alternatives and similar repositories for dpo-trajectory-reasoning

Users that are interested in dpo-trajectory-reasoning are comparing it to the libraries listed below

Sorting: