Oxen-AI / GRPO-With-Cargo-Feedback

This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
86Updated 2 months ago

Alternatives and similar repositories for GRPO-With-Cargo-Feedback

Users that are interested in GRPO-With-Cargo-Feedback are comparing it to the libraries listed below

Sorting: