LiuAmber / RAHFView on GitHub
[ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.acl-long.572/)
28Sep 25, 2024Updated last year

Alternatives and similar repositories for RAHF

Users that are interested in RAHF are comparing it to the libraries listed below

Sorting:

Are these results useful?