826568389/GRPO-R1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/826568389/GRPO-R1)

826568389 / GRPO-R1

☆13

Alternatives and similar repositories for GRPO-R1

Users that are interested in GRPO-R1 are comparing it to the libraries listed below

Sorting:

erichoangnle / chinese_chess
View on GitHub
2 players chinese chess game with PyGame
☆14Oct 25, 2022Updated 3 years ago
pingcy / mcp-deepresearch
View on GitHub
MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器，支持异步运行、超时控制与进度推送
☆31Jun 16, 2025Updated 9 months ago
AISG-Technology-Team / GCSS-Track-1A-Submission-Guide
View on GitHub
Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).
☆16Jul 4, 2024Updated last year
Huangjian2013 / ai-demo
View on GitHub
AI Demo 项目，一个专门为希望学习和探索人工智能（AI）技术的开发者准备的实战案例集合。
☆25Jan 3, 2026Updated 2 months ago
WangRongsheng / LLM101
View on GitHub
This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…
☆24May 5, 2025Updated 10 months ago
habout632 / llms
View on GitHub
llms related stuff , including code, docs
☆13Feb 25, 2025Updated last year
heyblackC / BetterMixture-Top1-Solution
View on GitHub
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆34Jul 7, 2024Updated last year
CrazyBoyM / LLM-Chinese
View on GitHub
（撰写ing..)本仓库偏教程性质，以「模型中文化」为一个典型的模型训练问题切入场景，指导读者上手学习LLM二次微调训练。
☆37Aug 5, 2024Updated last year
JuniorPan / 2018_interview
View on GitHub
放弃幻想、时刻准备、随时面试
☆14Dec 17, 2025Updated 3 months ago
chaizheng2157 / RGBD_ORB_SLAM2_RT
View on GitHub
Real-time envrionment reconstruction based on ORB_SLAM2 with XTION (RGBD sensor)
☆35May 28, 2016Updated 9 years ago
2404589803 / hf_downloader
View on GitHub
🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…
☆13Jan 5, 2025Updated last year
aiprodcoder / MixAPI-PRO
View on GitHub
大模型API企业网关，公司内部API管理，分发聚和系统，支持将多种大模型转换成统一的OpenAI兼容接口，尤其对国内开源模型deepseek,qwen，kimi，glm提供特别支持可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发)，长期更新，支…
☆40Sep 12, 2025Updated 6 months ago
JOHNNY-fans / RankNorm
View on GitHub
☆13Feb 21, 2025Updated last year
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 2 years ago
zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
zejunwang1 / bloom_tuning
View on GitHub
BLOOM 模型的指令微调
☆24Jun 15, 2023Updated 2 years ago
Duende510 / iOS-ImGui-ModMenu-Jailed
View on GitHub
Mod Menu for non-jailbroken devices
☆26Nov 12, 2024Updated last year
shibing624 / nerpy
View on GitHub
🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具，支持BertSoftmax、BertSpan等模型，开箱即用。
☆118Feb 19, 2024Updated 2 years ago
STAIR-BUPT / STAIR-LLMGuardrails
View on GitHub
☆12Sep 29, 2024Updated last year
hyongtao-code / chatDB-dify
View on GitHub
It is a simple demo of chatDB workflow in dify.
☆24Dec 7, 2024Updated last year
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
anastasiosyal / phi4-multimodal-instruct-server
View on GitHub
Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting
☆40Mar 2, 2025Updated last year
Paulzhang2023 / Dify-DSL-collection
View on GitHub
Dify DSL collection收集Dify工作流文件DSL，这里很多文件并不是本人原创，而是收集而来，感谢原作者。目前我是初学github，后面会加入大量原创内容
☆26Jul 13, 2025Updated 8 months ago
nwinter / ultimate-jailbreaking-championship
View on GitHub
Playing around with various jailbreaking techniques ahead of the Gray Swan AI Ultimate Jailbreaking Competition
☆18Oct 6, 2024Updated last year
prestodb / presto-query-predictor
View on GitHub
A query predictor pipeline and service to predict resource usages of Presto queries
☆15May 2, 2023Updated 2 years ago
rkuo2000 / GenAI
View on GitHub
☆11Updated this week
listen0425 / Safety-Layers
View on GitHub
code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)
☆22Apr 26, 2025Updated 10 months ago
SLIT-AI / ADPA
View on GitHub
[ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models
☆24Feb 10, 2025Updated last year
tntlinking-opensource / openhis-itai
View on GitHub
OpenHIS医院系统（信创版）集十大核心模块于一体，涵盖目录管理、基础数据配置、个性化设置、门诊/住院全流程管理、药房药库智能管控、精细化耗材管理、财务核算体系、医保合规对接及多维报表分析等功能模块，共计372项标准化功能。
☆15Feb 5, 2026Updated last month
StibiumT16 / Robust-Fine-tuning
View on GitHub
Code for Robust Fine-tuning (RbFT)
☆17Jan 31, 2025Updated last year
linancn / TianGong-AI-Unstructure
View on GitHub
TianGong-AI-Unstructure
☆71Feb 4, 2026Updated last month
OODRobustBench / OODRobustBench
View on GitHub
OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024
☆23Jul 25, 2024Updated last year
liangdabiao / deep_search_write
View on GitHub
AI写作小工具方案：让2个智能体合作写出真正可用的图文并茂的帖子（微信公众号，小红书，博客）。1，写作智能体，2，知识库智能体。
☆21Jun 8, 2025Updated 9 months ago
yanqiangmiffy / KDD2024-WhoIsWho-Top3
View on GitHub
KDD2024-WhoIsWho-Top3
☆16Jun 17, 2024Updated last year
GaoxiangLuo / LLM-BioMed-NER-RE
View on GitHub
[npj Digital Medicine] An In-Depth Evaluation of Federated Learning on Biomedical Natural Language Processing for Information Extraction
☆12May 1, 2024Updated last year
taishan1994 / BERT-ABSA
View on GitHub
使用bert进行中文方面级情感识别。
☆25Jun 26, 2023Updated 2 years ago
BenderScript / PromptInjectionBench
View on GitHub
Prompt Injection Attacks against GPT-4, Gemini, Azure, Azure with Jailbreak
☆29Oct 8, 2024Updated last year
WeipingFu / QGEval
View on GitHub
QGEval: A Benchmark for Question Generation Evaluation
☆19Nov 7, 2024Updated last year
lxfight / astrbot_plugin_deepresearch
View on GitHub
为 AstrBot 提供一种 Deepresearch 方案
☆29Aug 5, 2025Updated 7 months ago