sugarandgugu / Simple-Trl-Training

基于DPO算法微调语言大模型,简单好上手。
35Updated 8 months ago

Alternatives and similar repositories for Simple-Trl-Training:

Users that are interested in Simple-Trl-Training are comparing it to the libraries listed below