A curated list of papers and resources based on the survey "Agentic Reasoning for Large Language Models"
☆1,264Mar 9, 2026Updated 2 months ago
Alternatives and similar repositories for Awesome-Agentic-Reasoning
Users that are interested in Awesome-Agentic-Reasoning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated last year
- Multi-step reasoning MLLM☆24Mar 8, 2026Updated 3 months ago
- Official implementation of "Diffusion Language Models Know the Answer Before Decoding"☆56Apr 28, 2026Updated last month
- Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022☆15Mar 31, 2023Updated 3 years ago
- Implementation of the MetaController proposed in "Emergent temporal abstractions in autoregressive models enable hierarchical reinforceme…☆105May 23, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆46Nov 6, 2025Updated 7 months ago
- [EMNLP25 Main]The official code of "Gradient-Attention Guided Dual-Masking Synergetic Framework for Robust Text-based Person Retrieval"☆25Mar 30, 2026Updated 2 months ago
- AI agent skill for writing senior-engineer quality code through SOLID principles, TDD, and clean architecture☆439Apr 13, 2026Updated last month
- [ACL'25] Code for "Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering"☆21Jul 23, 2025Updated 10 months ago
- PyTorch KoBART/DistilKoBART Application☆14Oct 10, 2022Updated 3 years ago
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆29Feb 17, 2025Updated last year
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆57Mar 12, 2026Updated 2 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 2 months ago
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆437Mar 11, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement Learning via Self-Distillation (SDPO)☆929Feb 18, 2026Updated 3 months ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆22Mar 31, 2025Updated last year
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆145Feb 4, 2026Updated 4 months ago
- Generate Game Character for animation (SSD)☆36Mar 16, 2025Updated last year
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆44Jan 27, 2026Updated 4 months ago
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆574Sep 8, 2025Updated 9 months ago
- Open Knowledge Graph Resources is a static, daily-refreshed catalog of ontology and semantic software records sourced from Wikidata. It p…☆58Updated this week
- Defeating the Training-Inference Mismatch via FP16☆193Nov 14, 2025Updated 6 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆24Jul 4, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ArXiv 26] The official repository of "ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors".☆37Mar 5, 2026Updated 3 months ago
- Green-VLA: Staged Vision-Language-Action Model for Generalist Robots☆134Mar 5, 2026Updated 3 months ago
- Unofficial implementation of 'Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator'☆10Dec 10, 2024Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆44Nov 18, 2025Updated 6 months ago
- Open-source toolkit for RAG chunking: convert Markdown, validate documents, visualize and optimize chunking strategies, and enrich result…☆106May 27, 2026Updated last week
- M³: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM☆73Mar 18, 2026Updated 2 months ago
- ☆21Apr 17, 2023Updated 3 years ago
- Curated plugin marketplace for AI agents - works with Claude Code, Codex, and openskills☆994May 12, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems☆2,203May 16, 2026Updated 3 weeks ago
- [ICML'26] MemEvolve & EvolveLab☆231May 5, 2026Updated last month
- transformer which using numpy,vision transformer of VIT, MNIST testset precision = 97.2%,mutil-attention, patch embed, position embed, fu…☆12Mar 4, 2026Updated 3 months ago
- ☆13Jul 16, 2023Updated 2 years ago
- Benchmarking Optimizers for LLM Pretraining☆60May 3, 2026Updated last month
- ☆33Apr 1, 2026Updated 2 months ago
- [ACL 2026] R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning☆33Jan 4, 2026Updated 5 months ago