ADaM-BJTU / model-native-agentic-aiLinks
Our survey's paper list on Agentic AI, continuously updated with the latest research.
☆70Updated last month
Alternatives and similar repositories for model-native-agentic-ai
Users that are interested in model-native-agentic-ai are comparing it to the libraries listed below
Sorting:
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated 2 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆51Updated 2 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆167Updated 3 weeks ago
- VeriGUI: Verifiable Long-Chain GUI Dataset☆82Updated last month
- ☆32Updated 4 months ago
- Official repository of the video reasoning benchmark MMR-V. Can Your MLLMs "Think with Video"?☆36Updated 5 months ago
- ☆22Updated 6 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆86Updated 5 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆36Updated 10 months ago
- Official Code for "Learning to Reason via Mixture-of-Thought for Logical Reasoning"☆25Updated last week
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆121Updated 4 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆79Updated 3 weeks ago
- SFT+RL boosts multimodal reasoning☆37Updated 5 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆70Updated 5 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆79Updated 4 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training☆43Updated 3 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆151Updated 5 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Updated 2 months ago
- ☆168Updated last month
- ☆38Updated 3 months ago
- ☆123Updated last week
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆77Updated last week
- Data and Code for CVPR 2025 paper "MMVU: Measuring Expert-Level Multi-Discipline Video Understanding"☆75Updated 9 months ago
- ☆61Updated 2 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆87Updated 3 months ago
- ☆15Updated 5 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆353Updated 3 months ago
- ☆102Updated 10 months ago
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆243Updated 2 months ago
- ☆30Updated last week