mims-harvard / CUREBenchLinks
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
☆77Updated this week
Alternatives and similar repositories for CUREBench
Users that are interested in CUREBench are comparing it to the libraries listed below
Sorting:
- ToolUniverse is a collection of biomedical tools designed for AI agents☆198Updated last month
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆73Updated last year
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆46Updated 8 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Updated 8 months ago
- ☆43Updated 9 months ago
- Collection of latest papers and materials in the area of RLVR!☆22Updated 2 months ago
- A specialized LLM for study search, study screening, and data extraction from medical literature.☆12Updated 5 months ago
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science☆20Updated 10 months ago
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆30Updated last month
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆22Updated 3 weeks ago
- Must-read papers on AI for Biology☆19Updated last year
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆67Updated 5 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆14Updated last year
- ☆48Updated 5 months ago
- KGARevion: AI Agent for Knowledge-Intensive Biomedical QA☆28Updated 5 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning https://arxiv.org/abs/2503.07459☆53Updated last week
- ☆34Updated 3 weeks ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆89Updated 2 months ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆41Updated last year
- MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension☆45Updated 8 months ago
- GenoTEX: A multi-task benchmark for LLM agentic methods in automated gene expression data analysis.☆23Updated 4 months ago
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆48Updated 8 months ago
- Code for CTO: A Large Clinical Trial Outcome and QA Dataset☆25Updated last week
- Multi-Agent System for Science of Science☆58Updated 3 weeks ago
- Call for participation in the impact of LLM for scientific discovery☆73Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆83Updated 9 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆40Updated 4 months ago
- A toolkit for developing foundation models using Electronic Health Record (EHR) data.☆37Updated this week
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆84Updated last month
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆36Updated 3 months ago