ljc010717/GRPO2025

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ljc010717/GRPO2025)

ljc010717 / GRPO2025

☆22

Alternatives and similar repositories for GRPO2025

Users that are interested in GRPO2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AntResearchNLP / AlignXplore
View on GitHub
Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals
☆11Jan 8, 2026Updated 6 months ago
WKCHONG01 / TimeMixer-KAN
View on GitHub
☆11Oct 24, 2024Updated last year
Paul33333 / Agentic_RAG
View on GitHub
Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API
☆17Jun 21, 2025Updated last year
srsohn / TOD-Flow
View on GitHub
TOD-Flow: Modeling the Structure of Task-Oriented Dialogues
☆13Feb 7, 2024Updated 2 years ago
satori-reasoning / Satori-SWE
View on GitHub
☆21May 30, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
HarderThenHarder / AgentLife
View on GitHub
A small open source 3D agent simulator based on LLM.
☆70Dec 1, 2024Updated last year
anthony-wss / glm-4-voice-finetune
View on GitHub
☆14Apr 4, 2025Updated last year
weitongseu / PCL
View on GitHub
☆10Jul 11, 2022Updated 4 years ago
Zongwei97 / XMSNet
View on GitHub
[ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …
☆18Sep 15, 2023Updated 2 years ago
ChenglinYu / BHN
View on GitHub
☆10May 28, 2023Updated 3 years ago
JuzhengMiao / CauSSL
View on GitHub
☆31Oct 1, 2023Updated 2 years ago
hbhalpha / MDR
View on GitHub
☆25May 8, 2025Updated last year
Traego / scaled-mcp
View on GitHub
ScaledMCP is a horizontally scalabled MCP and A2A Server. You know, for AI.
☆43Aug 11, 2025Updated 11 months ago
happy-xlf / Poem_Knowledge
View on GitHub
古诗词知识图谱
☆17Mar 1, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
dengwentao99 / SLJA
View on GitHub
☆22May 22, 2024Updated 2 years ago
Ding-ZJ / GLoDe
View on GitHub
Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition (IJCAI 2024)
☆11Aug 18, 2024Updated last year
Tizzzzy / Demonstration_Selection_Overview
View on GitHub
✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"
☆16Nov 8, 2024Updated last year
wbopan / Awesome-EToDs-Survey
View on GitHub
Collection of papers, benchmarks and newest trends in the domain of End-to-end ToDs
☆14Nov 18, 2023Updated 2 years ago
X-jun-0130 / KB_LLM
View on GitHub
知识库、大语言模型、医疗知识库构建、基于大语言模型的知识库
☆30Jun 13, 2023Updated 3 years ago
cvlab-stonybrook / few-shot-scanpath
View on GitHub
☆16Oct 25, 2025Updated 9 months ago
Marstaos / MetaSearch
View on GitHub
MetaSearch：llm深度研究（deepsearch）功能方案实现
☆34Aug 21, 2025Updated 11 months ago
1IsMaple / TriBodyQA-LLM
View on GitHub
基于外挂知识库的大模型问答
☆23Mar 6, 2024Updated 2 years ago
MoMarky / EGMA
View on GitHub
The code of EGMA framework.
☆18Jun 14, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cjplol / sovits
View on GitHub
vocal generation network
☆14Mar 21, 2023Updated 3 years ago
NUST-Machine-Intelligence-Laboratory / PNP
View on GitHub
The source code and models for our paper PNP: Robust Learning from Noisy Labels by Probabilistic Noise Prediction
☆14Jan 30, 2023Updated 3 years ago
LiaoMengqi / E3-RL4LLMs
View on GitHub
[ EMNLP 2025 Main ] Enhancing Efficiency and Exploration in Reinforcement Learning for LLMs
☆17Nov 7, 2025Updated 8 months ago
Justherozen / TRAILER
View on GitHub
[CVPR 2024] Targeted Representation Alignment for Open-World Semi-Supervised Learning
☆14Sep 23, 2024Updated last year
uitrbn / TSCSI_IDN
View on GitHub
☆14Mar 1, 2023Updated 3 years ago
ZJUJeffLai / SAW_SSL
View on GitHub
☆14Oct 31, 2022Updated 3 years ago
Dylan9897 / LLM-TextClassification
View on GitHub
集成Qwen与DeepSeek等先进大语言模型，支持纯LLM+分类层模式及LLM+LoRA+分类层模式，使用transformers模块化设计和训练便于根据需要调整或替换组件。
☆21Sep 1, 2025Updated 10 months ago
AIIRWKV / RWKV-RAG-Personal
View on GitHub
RWKV-RAG个人版
☆28Aug 6, 2025Updated 11 months ago
aurora1625 / dmuthesis-latex
View on GitHub
LaTeX template for Dalian Maritime University Graduation Thesis
☆20May 27, 2013Updated 13 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Trae1ounG / 2024-BaiduAI-LLM-DSI
View on GitHub
2024百度商业AI技术创新大赛赛道一：基于大模型的广告检索全国一等奖获奖方案
☆19Feb 23, 2025Updated last year
wzhwzhwzh0921 / Awesome_LRM_with_Entropy
View on GitHub
Introduction about AWESOME_ENTROPY+LRM_PAPERS
☆32Dec 16, 2025Updated 7 months ago
GasolSun36 / MVP
View on GitHub
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning
☆24Sep 9, 2024Updated last year
VITA-Group / Nabla-Reasoner
View on GitHub
[ICLR'26] "Nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space" by Peihao Wang*, Ruisi Cai*, Zhen Wang, Hongyuan…
☆35Mar 10, 2026Updated 4 months ago
Raki-j / nlp-beginner-Raki
View on GitHub
复旦大学nlp实验室入门小实验nlp-beginner
☆27Jan 22, 2022Updated 4 years ago
PhysiLearn / SDS-Net
View on GitHub
[IEEE Trans. TGRS 2025] Shallow-Deep Synergism-detection Network for infrared small target detection
☆23Oct 30, 2025Updated 8 months ago
XueruiSu / Reproduce-DeepSeek-R1-Survey
View on GitHub
This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.
☆19Apr 27, 2025Updated last year