mims-harvard / CUREBenchLinks
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
☆51Updated last week
Alternatives and similar repositories for CUREBench
Users that are interested in CUREBench are comparing it to the libraries listed below
Sorting:
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆16Updated this week
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆47Updated 3 weeks ago
- ToolUniverse is a collection of biomedical tools designed for AI agents☆177Updated last week
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Updated 8 months ago
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆45Updated 7 months ago
- [NIPS2023]Implementation of Foundation Model is Efficient Multimodal Multitask Model Selector☆37Updated last year
- ☆51Updated 6 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆37Updated 9 months ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆41Updated last year
- ☆48Updated 5 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆38Updated 3 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆36Updated 2 months ago
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆74Updated last year
- The efficient tuning method for VLMs☆80Updated last year
- Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning (ICML 2023)☆18Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆59Updated last year
- Self-training LLaVA for medical☆16Updated 8 months ago
- ☆43Updated 8 months ago
- [CVPR 2025] MicroVQA eval and 🤖RefineBot code for "MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research"…☆21Updated 2 weeks ago
- Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"☆27Updated 3 months ago
- ☆13Updated 9 months ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Updated 2 years ago
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆32Updated 2 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆69Updated last year
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆20Updated 8 months ago
- [ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models☆41Updated 6 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆74Updated 4 months ago
- Collection of latest papers and materials in the area of RLVR!☆17Updated last month
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆77Updated last month