Joyce94 / LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
374Updated last year

Related projects

Alternatives and complementary repositories for LLM-RLHF-Tuning