KatherLab / ToolMakerLinks
Turn GitHub repositories into LLM tools. (ACL 2025)
☆46Updated 4 months ago
Alternatives and similar repositories for ToolMaker
Users that are interested in ToolMaker are comparing it to the libraries listed below
Sorting:
- MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs☆226Updated 3 months ago
- Top papers related to LLM-based agent evaluation☆84Updated 3 weeks ago
- ☆48Updated 7 months ago
- Towards Medical Small Language Models with Self-Evolved \\ Slow Thinking☆81Updated 4 months ago
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆109Updated 9 months ago
- ☆34Updated 4 months ago
- A Comprehensive Rare Disease Diagnostic Dataset with nearly 50,000 patients covering more than 4000 diseases☆16Updated 5 months ago
- MedAgentSim: Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions, MICCAI 2025 (early accepted)☆80Updated 3 months ago
- ☆38Updated 4 months ago
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆126Updated last month
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆87Updated 2 weeks ago
- Agent benchmark for medical diagnosis☆234Updated 9 months ago
- Democratizing AI scientists with ToolUniverse☆364Updated this week
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆71Updated last month
- For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code.☆61Updated last year
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆42Updated 5 months ago
- Medical Hallucination in Foundation Models and Their Impact on Healthcare (2025)☆71Updated 6 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆66Updated 7 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆48Updated 5 months ago
- Discovering Data-driven Hypotheses in the Wild☆113Updated 4 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 5 months ago
- ☆50Updated 4 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆69Updated 2 months ago
- ☆28Updated 7 months ago
- ☆40Updated 4 months ago
- Optimize Any User-defined Compound AI Systems☆49Updated last month
- ☆30Updated 8 months ago
- Repository for Zochi's Research☆276Updated last month
- [NeurIPS 2024 D&B Track, Spotlight] UltraMedical: Building Specialized Generalists in Biomedicine☆92Updated last year