826568389 / GRPO-R1Links
☆13Updated 6 months ago
Alternatives and similar repositories for GRPO-R1
Users that are interested in GRPO-R1 are comparing it to the libraries listed below
Sorting:
- GoGPT中文指令数据集构造☆10Updated last year
- 通用简单工具项目☆20Updated 11 months ago
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆32Updated last year
- ☆19Updated last year
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆14Updated last month
- 大模型智能体Agent中文教程,博客代码仓库☆39Updated last month
- the newest version of llama3,source code explained line by line using Chinese