ChengpengLi1003 / Awesome-Long-Chain-of-Thought-Reasoning-with-toolsView external linksLinks
A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.
☆45Dec 17, 2025Updated 2 months ago
Alternatives and similar repositories for Awesome-Long-Chain-of-Thought-Reasoning-with-tools
Users that are interested in Awesome-Long-Chain-of-Thought-Reasoning-with-tools are comparing it to the libraries listed below
Sorting:
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Dec 19, 2024Updated last year
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆69May 13, 2025Updated 9 months ago
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated 10 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Jun 4, 2025Updated 8 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆90Sep 30, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆13Sep 23, 2022Updated 3 years ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆72May 25, 2025Updated 8 months ago
- ☆11May 17, 2024Updated last year
- The official implementation of "Enhancing Representation in Radiography-Reports Foundation Model: A Granular Alignment Algorithm Using Ma…☆13Sep 13, 2024Updated last year
- A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters…☆30Oct 10, 2025Updated 4 months ago
- ☆11Jun 21, 2025Updated 7 months ago
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- The official repository for the CodeGym project: "Generalizable End-to-End Tool-Use RL with Synthetic CodeGym"☆22Oct 14, 2025Updated 4 months ago
- ☆10Sep 18, 2021Updated 4 years ago
- AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation (EMNLP 2024 Findings)☆15Dec 30, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 9 months ago
- Hands-on repository for fine-tuning Large Language Models (LLMs) in the clinical domain with tutorials☆13Jan 9, 2026Updated last month
- [2025 ACL Findings] Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization☆25Oct 29, 2025Updated 3 months ago
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 6 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆53Jun 6, 2025Updated 8 months ago
- **ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…☆12Mar 26, 2023Updated 2 years ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 5 months ago
- ☆10Nov 14, 2021Updated 4 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 6 months ago
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆17Feb 21, 2025Updated 11 months ago
- Automatic Classification of Human Body Parts from X-ray Images Using Deep Convolutional Neural Networks☆12Aug 12, 2021Updated 4 years ago
- 🩻 NV-Reason-CXR-3B is a specialized vision-language model designed for medical reasoning and interpretation of chest X-ray images.☆42Oct 29, 2025Updated 3 months ago
- ☆17Sep 18, 2025Updated 4 months ago
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated 8 months ago
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆13Jul 27, 2025Updated 6 months ago
- ☆11Aug 10, 2022Updated 3 years ago
- [NeurIPS25] RULE: Reinforcement UnLEarning Achieves Forge-retain Pareto Optimality☆19Oct 22, 2025Updated 3 months ago
- ☆10Jan 1, 2022Updated 4 years ago
- Under construction☆13Jan 15, 2025Updated last year
- ☆25Sep 18, 2025Updated 5 months ago
- [NeurIPS 2024] Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators☆16Nov 15, 2024Updated last year
- HeartBench is an evaluation benchmark for the psychological and social sciences field, designed to transcend traditional knowledge and re…☆28Jan 7, 2026Updated last month
- Rethinking Whole-Body CT Image Interpretation: An Abnormality-Centric Approach☆19Nov 17, 2025Updated 3 months ago