lemon-prog123 / LongRePS
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
☆13Updated last month
Alternatives and similar repositories for LongRePS
Users that are interested in LongRePS are comparing it to the libraries listed below
Sorting:
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.