lucidrains / llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning
157Updated 11 months ago

Related projects

Alternatives and complementary repositories for llama-qrlhf