Oxen-AI / GRPO-With-Cargo-Feedback

This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
80Updated last month

Alternatives and similar repositories for GRPO-With-Cargo-Feedback:

Users that are interested in GRPO-With-Cargo-Feedback are comparing it to the libraries listed below