kxfan2002/Reagent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kxfan2002/Reagent)

kxfan2002 / Reagent

Agent-RRM: Exploring Reasoning Reward Model for Agents

☆70

Alternatives and similar repositories for Reagent

Users that are interested in Reagent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

appletea233 / EditThinker
View on GitHub
Unlocking Iterative Reasoning for Any Image Editor
☆111Jan 18, 2026Updated 6 months ago
THUDM / CaRR
View on GitHub
This repository contains the code and data for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents wit…
☆72Apr 8, 2026Updated 3 months ago
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆66Mar 27, 2026Updated 3 months ago
LAW1223 / OpenSubject
View on GitHub
☆55Dec 10, 2025Updated 7 months ago
InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Hesse73 / RLVR-Directions
View on GitHub
Source Code for our ICLR'26 paper
☆17Feb 22, 2026Updated 5 months ago
shawn0728 / Unify-Agent
View on GitHub
🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.
☆86May 2, 2026Updated 2 months ago
tulerfeng / OneThinker
View on GitHub
🔥 OneThinker: All-in-one Reasoning Model for Image and Video [CVPR 2026]
☆463Feb 28, 2026Updated 4 months ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,090Jul 13, 2026Updated last week
LuckyyySTA / GOLF
View on GitHub
☆18Mar 16, 2026Updated 4 months ago
Trae1ounG / Pretrain_Space_RLVR
View on GitHub
[arxiv: 2604.14142] From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
☆17Apr 16, 2026Updated 3 months ago
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
dengmengjie / ToolScope
View on GitHub
Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
☆31Nov 4, 2025Updated 8 months ago
shawn0728 / OpenSearch-VL
View on GitHub
🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…
☆254May 19, 2026Updated 2 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
RUC-NLPIR / DeepImageSearch
View on GitHub
☆86May 2, 2026Updated 2 months ago
RUC-NLPIR / VideoDeepResearch
View on GitHub
☆155Nov 17, 2025Updated 8 months ago
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
tulerfeng / Gen-Searcher
View on GitHub
Gen-Searcher: Reinforcing Agentic Search for Image Generation
☆376Apr 7, 2026Updated 3 months ago
kxfan2002 / SophiaVL-R1
View on GitHub
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
☆94Aug 8, 2025Updated 11 months ago
RUC-NLPIR / EnvScaler
View on GitHub
The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".
☆176Feb 12, 2026Updated 5 months ago
RUC-NLPIR / GISA
View on GitHub
GISA: A Benchmark for General Information-Seeking Assistant
☆36Mar 20, 2026Updated 4 months ago
RUC-NLPIR / SmartSearch
View on GitHub
☆45Jan 19, 2026Updated 6 months ago
Osilly / Vision-DeepResearch
View on GitHub
[ICML 2026] Multimodal deep-research MLLM and benchmark. The first long-horizon multimodal deep-research MLLM, extending the number of re…
☆657Jun 8, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tu-tuing / SlowBA
View on GitHub
[🏆ECCV'26] Official Repo for SlowBA: An efficiency backdoor attack towards VLM-based GUI agents
☆15Jul 1, 2026Updated 3 weeks ago
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆165Mar 2, 2026Updated 4 months ago
plageon / HierSearch
View on GitHub
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
☆40Oct 9, 2025Updated 9 months ago
zlab-princeton / VisionFoundry
View on GitHub
VisionFoundry: Teaching VLMs Visual Perception with Synthetic Images
☆52Apr 28, 2026Updated 2 months ago
ClawGym / ClawGym-Agents
View on GitHub
☆33Jun 30, 2026Updated 3 weeks ago
haon-chen / MoCa
View on GitHub
☆68Aug 14, 2025Updated 11 months ago
X-LANCE / text2sql-multiturn-GPT
View on GitHub
[NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
☆13May 7, 2024Updated 2 years ago
Yuqi-Zhou / LRAT
View on GitHub
The implementation for SIGIR 2026: Learning to Retrieve from Agent Trajectories.
☆55Jul 14, 2026Updated last week
RUC-NLPIR / iAgent
View on GitHub
Including 12+ cutting-edge agent systems across multiple research directions
☆35Nov 10, 2025Updated 8 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
microsoft / Text2Grad
View on GitHub
🚀 Text2Grad: Converting natural language feedback into gradient signals for precise model optimization. Revolutionizing RLHF with span-l…
☆37Feb 6, 2026Updated 5 months ago
CLR-Lab / SimKO
View on GitHub
SimKO: Simple Pass@K Policy Optimization
☆31Oct 24, 2025Updated 8 months ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
Zanette-Labs / speed-rl
View on GitHub
☆18Feb 2, 2026Updated 5 months ago
ZJU-REAL / SkillZero
View on GitHub
Official code for "SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization"
☆354Updated this week
plageon / MemSifter
View on GitHub
MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning
☆68Jun 14, 2026Updated last month
GuoqingWang1 / IGPO
View on GitHub
[ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents
☆128Jul 14, 2026Updated last week