ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆31Updated last month
Alternatives and similar repositories for GRPO-Training:
Users that are interested in GRPO-Training are comparing it to the libraries listed below
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆39Updated 3 weeks ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆107Updated last month
- ☆87Updated this week
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- ☆29Updated last year