lucidrains / llama-qrlhfView on GitHub
Implementation of the Llama architecture with RLHF + Q-learning
β˜†170Feb 1, 2025Updated last year

Alternatives and similar repositories for llama-qrlhf

Users that are interested in llama-qrlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?