wshi83 / MedAgentGymLinks
This is the official repository for paper "MedAgentGYM: Training LLM Agents for Code-Based Medical Reasoning at Scale"
☆20Updated last week
Alternatives and similar repositories for MedAgentGym
Users that are interested in MedAgentGym are comparing it to the libraries listed below
Sorting:
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆33Updated 2 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated last month
- ☆48Updated 3 months ago
- ☆27Updated 4 months ago
- ☆37Updated 5 months ago
- Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆35Updated last week
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆45Updated this week
- The official implementation of SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning☆15Updated last month
- ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning☆59Updated last week
- MC-CoT implementation code☆16Updated 7 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆36Updated 3 weeks ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆35Updated last month
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆24Updated last month
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆30Updated 3 weeks ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆21Updated 3 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆72Updated 3 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 8 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆108Updated 3 weeks ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆62Updated 3 weeks ago
- ☆36Updated 2 weeks ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆59Updated 5 months ago
- ☆16Updated 2 months ago
- [ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"☆19Updated 4 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆27Updated last month
- Preference Learning for LLaVA☆46Updated 7 months ago
- Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards☆26Updated last week
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68Updated last year
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 8 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding☆35Updated last month
- This the implementation of LeCo☆31Updated 5 months ago