KatherLab / ToolMakerLinks
Turn GitHub repositories into LLM tools. (ACL 2025)
☆58Updated 7 months ago
Alternatives and similar repositories for ToolMaker
Users that are interested in ToolMaker are comparing it to the libraries listed below
Sorting:
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆245Updated 6 months ago
- Agent benchmark for medical diagnosis☆265Updated 11 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (oral and early accepted)☆104Updated 3 weeks ago
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆85Updated 2 weeks ago
- ☆48Updated 9 months ago
- [ACL 2025] Multi-Agent System for Science of Science☆64Updated 4 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆117Updated 11 months ago
- ☆37Updated 6 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆67Updated 2 months ago
- ☆38Updated 6 months ago
- MIRIAD is a million-scale Medical Instruction and Retrieval Datatset☆135Updated 3 weeks ago
- Repository for Zochi's Research☆295Updated last month
- MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM Agents☆184Updated 3 weeks ago
- Medical Hallucination in Foundation Models and Their Impact on Healthcare (2025)☆76Updated last month
- Official repository of the MIRAGE benchmark☆185Updated last year
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆96Updated last month
- [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆47Updated 8 months ago
- This is the code of MMOA-RAG.☆94Updated 7 months ago
- Official Code Release for "Training a Generally Curious Agent"☆39Updated 7 months ago
- [arxiv'25] MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale☆70Updated 4 months ago
- A Comprehensive Rare Disease Diagnostic Dataset with nearly 50,000 patients covering more than 4000 diseases☆17Updated 7 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆115Updated 3 months ago
- [NeurIPS 2024 Datasets and Benchmark Track Oral] MedCalc-Bench: Evaluating Large Language Models for Medical Calculations☆77Updated this week
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆103Updated last month
- Official repository for RAG-Gym☆117Updated 9 months ago
- Top papers related to LLM-based agent evaluation☆86Updated last month
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆94Updated last year
- A collection of resources and papers on AI Scientist / Robot Scientist☆116Updated 2 months ago
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆84Updated last month