sail-sg / feedback-conditional-policyLinks

Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
33Updated this week

Alternatives and similar repositories for feedback-conditional-policy

Users that are interested in feedback-conditional-policy are comparing it to the libraries listed below

Sorting: