KatherLab / ToolMakerLinks
Turn GitHub repositories into LLM tools. (ACL 2025)
☆66Updated 8 months ago
Alternatives and similar repositories for ToolMaker
Users that are interested in ToolMaker are comparing it to the libraries listed below
Sorting:
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆253Updated 7 months ago
- Repository for Zochi's Research☆298Updated 2 months ago
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆92Updated 2 months ago
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆87Updated 2 months ago
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆102Updated 2 months ago
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆76Updated this week
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆177Updated 3 months ago
- Agent benchmark for medical diagnosis☆277Updated last year
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆213Updated 3 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆122Updated this week
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆123Updated 5 months ago
- [ACL 2025] Multi-Agent System for Science of Science☆65Updated 6 months ago
- Top papers related to LLM-based agent evaluation☆89Updated 3 months ago
- ☆328Updated 6 months ago
- ☆40Updated last week
- Discovering Data-driven Hypotheses in the Wild☆128Updated 7 months ago
- LLM for Scientific Research Survey☆123Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆94Updated 2 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆121Updated last year
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆216Updated 2 months ago
- Official repository for RAG-Gym☆120Updated 11 months ago
- ☆48Updated 11 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆86Updated 3 months ago
- ☆40Updated 8 months ago
- Optimize Any User-defined Compound AI Systems☆66Updated 5 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆73Updated 3 months ago
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆102Updated 5 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆167Updated 3 months ago
- Official implementation of MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems☆73Updated 7 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆84Updated last year