[ACL 2024 main] Aligning Large Language Models with Human Preferences through Representation Engineering (https://aclanthology.org/2024.acl-long.572/)
☆28Sep 25, 2024Updated last year
Alternatives and similar repositories for RAHF
Users that are interested in RAHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated 2 years ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆35Jan 31, 2025Updated last year
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- ☆46Oct 1, 2024Updated last year