jasonvanf / llama-trl

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
β˜†185Updated last year

Related projects β“˜

Alternatives and complementary repositories for llama-trl