YsTvT/Awesome-Agentic-RL-Papers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YsTvT/Awesome-Agentic-RL-Papers)

YsTvT / Awesome-Agentic-RL-Papers

☆107

Alternatives and similar repositories for Awesome-Agentic-RL-Papers

Users that are interested in Awesome-Agentic-RL-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harkerhand / seu_utils
View on GitHub
适用于东南大学学生的工具集合
☆16Sep 10, 2025Updated 10 months ago
HHHHHejia / Awesome-AgenticLLM-RL-Papers
View on GitHub
☆1,846Jun 18, 2026Updated last month
jingyingma01 / CodeBrain
View on GitHub
[ICLR'26] CodeBrain: Bridging Decoupled Tokenizer and Multi-Scale Architecture for EEG Foundation Models
☆25May 19, 2026Updated 2 months ago
ZhengLi2004 / SEU-Graduation-Thesis-Template
View on GitHub
☆27May 11, 2026Updated 2 months ago
shaohao011 / MedCCO
View on GitHub
[ACM MM2026] This is the official implementation of MedCCO
☆17Jul 12, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
shiqichen17 / SPA
View on GitHub
Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"
☆36Nov 1, 2025Updated 8 months ago
Cassie07 / AgentSkill_Survey
View on GitHub
Agent Skill Evaluation and Evolution: Frameworks and Benchmarks
☆25Jul 15, 2026Updated last week
Yuancheng-Xu / GenARM
View on GitHub
Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"
☆24Feb 10, 2025Updated last year
Kamichanw / SEU-Course-Homework
View on GitHub
东南大学计软智部分课程作业。你的时间值得更有价值的事。
☆180Jun 7, 2025Updated last year
liangxinlizi / SEU-CSE-COURSERESOURCES
View on GitHub
☆17Feb 23, 2025Updated last year
exoskeletonzj / MARS
View on GitHub
A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization
☆18Dec 15, 2025Updated 7 months ago
ventr1c / Awesome-RL-based-Agentic-Search-Papers
View on GitHub
The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Eva…
☆279Updated this week
K1XE / InterviewForge
View on GitHub
Local-first interview recording review reports with a Codex skill and CLI.
☆76May 16, 2026Updated 2 months ago
thomasyyyoung / ToxiBenchCN
View on GitHub
[ACL-2025-Findings] The official GitHub repo for the paper "Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchma…
☆23Jun 8, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ZichenWen1 / EPIC
View on GitHub
(NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"
☆49Feb 11, 2026Updated 5 months ago
Qsingle / open-medical-r1
View on GitHub
This repository is aim to reproduce the R1-Zero on medical domain.
☆32Jun 11, 2025Updated last year
RockyChen0205 / STGE-Former
View on GitHub
Spatial-Temporal Graph-Enhanced Transformer for EEG Based Major Depressive Disorder Detection
☆22Feb 8, 2026Updated 5 months ago
Tree-Shu-Zhao / ferret
View on GitHub
An extensible RL framework for training LLM agents with advanced search capabilities, built on VERL and supporting state-of-the-art searc…
☆35Dec 1, 2025Updated 7 months ago
DeepReasoning / DeepMedix-R1
View on GitHub
☆63Sep 3, 2025Updated 10 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
ulab-uiuc / AgentProtocols
View on GitHub
Opensource code for ICML 2026 poster
☆15Nov 26, 2025Updated 7 months ago
dykang / xslue
View on GitHub
ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy
☆15Jul 19, 2021Updated 5 years ago
WalkerMitty / Fast-Llama2
View on GitHub
Fast instruction tuning with Llama2
☆11Apr 8, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
allenai / olmix
View on GitHub
☆41May 26, 2026Updated 2 months ago
multimodalpragmatic / multimodalpragmatic
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
leostudiooo / GOOSE
View on GitHub
GOOSE Opens workOut for SEU undErgraduates
☆21Jan 28, 2026Updated 5 months ago
idoru / openclaw-rl
View on GitHub
OpenClaw-RL: Personalize openclaw simply by talking to it
☆16Feb 26, 2026Updated 4 months ago
ai4nucleome / BioMaster
View on GitHub
☆102Jul 14, 2026Updated last week
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,092Jul 13, 2026Updated last week
malthee / evolutionary-diffusion
View on GitHub
Applying Evolutionary Computing to Embeddings of Diffusion Models
☆16Jun 6, 2026Updated last month
Applied-Machine-Learning-Lab / GAVE
View on GitHub
☆66Jul 12, 2025Updated last year
jiahao-shao1 / openclaw-setup
View on GitHub
☆16Mar 8, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ParrotClever / DL_point
View on GitHub
☆12Jan 9, 2025Updated last year
xufangzhi / Logiformer
View on GitHub
[SIGIR 2022] The implementation of Logiformer
☆28Jan 11, 2024Updated 2 years ago
feipiao594 / RopUI
View on GitHub
Personal project about cpp multiplatform ui framework.
☆15Mar 8, 2026Updated 4 months ago
seketeam / group-meeting-slides
View on GitHub
☆14Jun 3, 2025Updated last year
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,153Nov 13, 2025Updated 8 months ago
1229095296 / ResRL
View on GitHub
This repository includes code for our paper: ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning…
☆15May 2, 2026Updated 2 months ago
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago