daveshap / RLHI

Reinforcement Learning with Heuristic Imperatives - Finetuning LLMs for Post-Conventional Moral Intuition
β˜†64Updated last year

Alternatives and similar repositories for RLHI:

Users that are interested in RLHI are comparing it to the libraries listed below