Joyce94 / LLM-RLHF-TuningLinks

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
420Updated last year

Alternatives and similar repositories for LLM-RLHF-Tuning

Users that are interested in LLM-RLHF-Tuning are comparing it to the libraries listed below

Sorting: