andersonbcdefg / dpo-lora

direct preference optimization with only 1 model copy :)
12Updated last year

Related projects

Alternatives and complementary repositories for dpo-lora