swtheing / PF-PPO-RLHF
☆30Updated 7 months ago
Alternatives and similar repositories for PF-PPO-RLHF:
Users that are interested in PF-PPO-RLHF are comparing it to the libraries listed below
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"