☆42Mar 6, 2025Updated 11 months ago
Alternatives and similar repositories for grpo-loss
Users that are interested in grpo-loss are comparing it to the libraries listed below
Sorting:
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated last month
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- ☆11Feb 15, 2026Updated last week
- 训练自己的中文 Embedding 模型☆28Jan 6, 2025Updated last year
- differentiable top-k operator☆22Dec 30, 2024Updated last year
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- ☆22Feb 14, 2026Updated last week
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- ppo算法实现☆39Jun 5, 2024Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆34Jul 7, 2024Updated last year
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation☆36Feb 6, 2026Updated 3 weeks ago
- ☆28Dec 4, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 5 months ago
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Jan 28, 2023Updated 3 years ago
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆56Apr 13, 2025Updated 10 months ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification