ChengpengLi1003/Awesome-Long-Chain-of-Thought-Reasoning-with-tools

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChengpengLi1003/Awesome-Long-Chain-of-Thought-Reasoning-with-tools)

ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-tools

A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.

☆46

Alternatives and similar repositories for Awesome-Long-Chain-of-Thought-Reasoning-with-tools

Users that are interested in Awesome-Long-Chain-of-Thought-Reasoning-with-tools are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChengpengLi1003 / CoRT
View on GitHub
☆72Oct 23, 2025Updated 8 months ago
FreedomIntelligence / MyPhoneBench
View on GitHub
MyPhoneBench: Do Phone-Use Agents Respect Your Privacy?
☆24Apr 3, 2026Updated 3 months ago
hzy312 / knowledge-r1
View on GitHub
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆70May 13, 2025Updated last year
UmeanNever / RankSurprisalRatio
View on GitHub
[ACL 2026 Main] Official Repo for Paper "Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Ali…
☆17Jul 1, 2026Updated 2 weeks ago
jinzhuoran / RAG-RewardBench
View on GitHub
RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment
☆18Dec 19, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
597358816 / AEPO
View on GitHub
Arbitrary Entropy Policy Optimization: Entropy Is Controllable in Reinforcement Fine-tuning
☆17Jan 19, 2026Updated 6 months ago
RUCAIBox / R1-Searcher-plus
View on GitHub
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
☆81May 25, 2025Updated last year
JackShDr / InfluentialRS
View on GitHub
Implementations of Influential Recommender System
☆12Oct 29, 2024Updated last year
ernie-research / Tool-Augmented-Reward-Model
View on GitHub
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆54Jun 6, 2025Updated last year
1KE-JI / UPFT
View on GitHub
Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…
☆20Jun 13, 2025Updated last year
ydk122024 / Med-HallMark
View on GitHub
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models
☆14Jun 24, 2024Updated 2 years ago
ChengpengLi1003 / Q-learning
View on GitHub
针对最经典的表格型Q learning算法进行了复现，能够支持gym中大多数的离散动作和状态空间的环境，譬如CliffWalking-v0。
☆10Jan 2, 2021Updated 5 years ago
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
TIMMY-CHAN / MISS
View on GitHub
[ICANN 2024 (Oral)] MISS: A Generative Pre-training and Fine-tuning Approach for Med-VQA
☆12Aug 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
1KE-JI / HierVerb
View on GitHub
Official resources of "Hierarchical Verbalizer for Few-Shot Hierarchical Text Classification" (ACL 2023 long).
☆28Jul 30, 2023Updated 2 years ago
sheep333c / DIVE
View on GitHub
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
☆26Mar 13, 2026Updated 4 months ago
Ruiyang-061X / Awesome-Search-RL
View on GitHub
☆44Jun 10, 2025Updated last year
w-yibo / R1-Compress
View on GitHub
[NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search
☆17Jan 24, 2026Updated 5 months ago
Victorwz / LaViA
View on GitHub
☆10Jul 13, 2024Updated 2 years ago
FreedomIntelligence / SepsisAgent
View on GitHub
Agentifying Patient Dynamics within LLMs through Interacting with Clinical World Model
☆30May 15, 2026Updated 2 months ago
Blueyee / Efficient-CoT-LRMs
View on GitHub
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆71Apr 1, 2025Updated last year
FuCongResearchSquad / ManCAR
View on GitHub
Official implementation of the KDD'26 paper "ManCAR: Manifold-Constrained Latent Reasoning with Adaptive Test-Time Computation for Sequen…
☆21May 28, 2026Updated last month
RUCKBReasoning / CoT-based-Synthesizer
View on GitHub
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆32May 19, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WillDreamer / Awesome-MLLM-Reasoning
View on GitHub
Recent Advances on MLLM's Reasoning Ability
☆26Apr 11, 2025Updated last year
gaojingsheng / LiveChat
View on GitHub
Code and Dataset for the paper "LiveChat: A Large-Scale Personalized Dialogue Dataset Automatically Constructed from Live Streaming" ACL …
☆38Aug 21, 2023Updated 2 years ago
LuoXiaoHeics / Continual-Tune
View on GitHub
☆10Feb 6, 2025Updated last year
FreedomIntelligence / ApolloMoE
View on GitHub
[ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts
☆53Nov 20, 2024Updated last year
wlzhang2020 / ReasonRAG
View on GitHub
Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning
☆46Jun 24, 2025Updated last year
UCDvision / PatchSearch
View on GitHub
Code for the CVPR '23 paper, "Defending Against Patch-based Backdoor Attacks on Self-Supervised Learning"
☆10Jun 9, 2023Updated 3 years ago
TIMMY-CHAN / MILE
View on GitHub
[MICCAI 2024] Can LLMs' Tuning Methods Work in Medical Multimodal Domain?
☆17Sep 18, 2024Updated last year
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆31Dec 12, 2024Updated last year
mfshiu / kaqg
View on GitHub
AI-Powered Assessment System with Knowledge Graphs and RAG
☆15Mar 8, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bobxwu / learning-from-rewards-llm-papers
View on GitHub
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…
☆73Jun 13, 2025Updated last year
analokmaus / kaggle-aimo2-fast-math-r1
View on GitHub
Kaggle AIMO2 solution with token-efficient reasoning LLM recipes
☆50Aug 7, 2025Updated 11 months ago
zjunlp / TRICE
View on GitHub
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
☆43Mar 14, 2024Updated 2 years ago
Jl-wei / guing
View on GitHub
A mobile GUI search engine using a vision-language model
☆15May 5, 2025Updated last year
NVIDIA / When2Call
View on GitHub
A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions
☆66Apr 29, 2025Updated last year
0russwest0 / Awesome-Agent-RL
View on GitHub
☆511Oct 11, 2025Updated 9 months ago
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year