XanderJC / attention-based-credit

Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
22Updated 5 months ago

Alternatives and similar repositories for attention-based-credit:

Users that are interested in attention-based-credit are comparing it to the libraries listed below