A curated list of papers and resources based on the survey "Agentic Reasoning for Large Language Models"
☆1,211Mar 9, 2026Updated last month
Alternatives and similar repositories for Awesome-Agentic-Reasoning
Users that are interested in Awesome-Agentic-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- High-performance React virtualized diff viewer for large code / text comparison☆33Apr 28, 2026Updated last week
- Multi-step reasoning MLLM☆22Mar 8, 2026Updated last month
- Aligning Agentic World Models via Knowledgeable Experience Learning☆32Jan 25, 2026Updated 3 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆56Updated this week
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆15Mar 31, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 5 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆26Mar 30, 2026Updated last month
- Temporal Graph Rewiring Method with Expander Graphs☆12Oct 18, 2024Updated last year
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 9 months ago
- PyTorch KoBART/DistilKoBART Application☆14Oct 10, 2022Updated 3 years ago
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆28Feb 17, 2025Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆63Mar 17, 2026Updated last month
- Reinforcement Learning via Self-Distillation (SDPO)☆822Feb 18, 2026Updated 2 months ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆134Feb 4, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- Generate Game Character for animation (SSD)☆35Mar 16, 2025Updated last year
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆44Jan 27, 2026Updated 3 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆564Sep 8, 2025Updated 7 months ago
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆33Mar 5, 2026Updated 2 months ago
- Defeating the Training-Inference Mismatch via FP16☆192Nov 14, 2025Updated 5 months ago
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆125Mar 5, 2026Updated 2 months ago
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆68Mar 18, 2026Updated last month
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆44Nov 18, 2025Updated 5 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆41Mar 24, 2026Updated last month
- ☆21Apr 17, 2023Updated 3 years ago
- [Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems☆2,096Oct 11, 2025Updated 6 months ago
- MemEvolve & EvolveLab☆211Dec 23, 2025Updated 4 months ago
- Benchmarking Optimizers for LLM Pretraining☆57Dec 30, 2025Updated 4 months ago
- transformer which using numpy,vision transformer of VIT, MNIST testset precision = 97.2%,mutil-attention, patch embed, position embed, fu…☆12Mar 4, 2026Updated 2 months ago
- ☆13Jul 16, 2023Updated 2 years ago
- A curated list of resources for "Flow Matching Meets Biology and Life Science: A Survey". Nature Portfolio Journal Artificial Intelligenc…☆80Mar 7, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆33Apr 1, 2026Updated last month
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆59Nov 4, 2025Updated 6 months ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆29Jan 4, 2026Updated 4 months ago
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆54Jul 15, 2025Updated 9 months ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆55Mar 13, 2026Updated last month
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆46Mar 26, 2026Updated last month