facebookresearch / RLCDView on GitHub
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
69Aug 18, 2023Updated 2 years ago

Alternatives and similar repositories for RLCD

Users that are interested in RLCD are comparing it to the libraries listed below

Sorting:

Are these results useful?