EsYoon7 / RLHF-TLCR
View external linksLinks

[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
12Dec 6, 2024Updated last year

Alternatives and similar repositories for RLHF-TLCR

Users that are interested in RLHF-TLCR are comparing it to the libraries listed below

Sorting:

Are these results useful?