flint-xf-fan / Federated-RLHFView on GitHub
[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple instances of GPT-2 for personalized sentiment aligned text generation.
16Apr 16, 2025Updated last year

Alternatives and similar repositories for Federated-RLHF

Users that are interested in Federated-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?