agi-templar / Stable-AlignmentLinks

Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
346Updated last year

Alternatives and similar repositories for Stable-Alignment

Users that are interested in Stable-Alignment are comparing it to the libraries listed below

Sorting: