facebookresearch / RLCDLinks
Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment
☆69Updated last year
Alternatives and similar repositories for RLCD
Users that are interested in RLCD are comparing it to the libraries listed below
Sorting: