mims-harvard / CUREBenchLinks
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
☆114Updated last week
Alternatives and similar repositories for CUREBench
Users that are interested in CUREBench are comparing it to the libraries listed below
Sorting:
- Democratizing AI scientists with ToolUniverse☆453Updated this week
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆76Updated last year
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆55Updated 3 weeks ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆59Updated last week
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science☆22Updated last year
- ☆43Updated 11 months ago
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆30Updated 3 months ago
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆90Updated 3 months ago
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆67Updated 7 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Updated 11 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆14Updated last year
- A specialized LLM for study search, study screening, and data extraction from medical literature.☆20Updated 7 months ago
- ☆35Updated 2 years ago
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆226Updated 6 months ago
- Code and data for Cell-o1.☆25Updated last month
- InstructMol: Multi-Modal Integration for Building a Versatile and Reliable Molecular Assistant in Drug Discovery (COLING 2025)☆50Updated 10 months ago
- Paper list of agent for science☆133Updated 2 weeks ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆52Updated last year
- A collection of resources and papers on AI Scientist / Robot Scientist☆101Updated 2 weeks ago
- A virtual lab of LLM agents for science research☆500Updated 2 months ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 4 months ago
- ☆48Updated 7 months ago
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆28Updated 2 months ago
- ☆49Updated last year
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆286Updated 11 months ago
- Must-read papers on AI for Biology☆19Updated 2 years ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆119Updated last year
- [EMNLP2024] Benchmark for "Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark"☆30Updated last month
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆42Updated 3 months ago
- ☆97Updated 3 weeks ago