ssbuild / llm_rlhf

realize the reinforcement learning training for gpt2 llama bloom and so on llm model
26Updated last year

Related projects

Alternatives and complementary repositories for llm_rlhf