lucidrains / llama-qrlhf

Implementation of the Llama architecture with RLHF + Q-learning
156Updated 10 months ago

Related projects

Alternatives and complementary repositories for llama-qrlhf