mims-harvard / CUREBenchLinks
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
☆114Updated 2 weeks ago
Alternatives and similar repositories for CUREBench
Users that are interested in CUREBench are comparing it to the libraries listed below
Sorting:
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆76Updated last year
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆57Updated last month
- Democratizing AI scientists with ToolUniverse☆629Updated last week
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science☆22Updated last year
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆30Updated 4 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆63Updated last month
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆53Updated last year
- A specialized LLM for study search, study screening, and data extraction from medical literature.☆21Updated 7 months ago
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆90Updated 4 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆14Updated last year
- ☆43Updated 11 months ago
- ☆19Updated 8 months ago
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆69Updated 7 months ago
- Must-read papers on AI for Biology☆21Updated 2 years ago
- Code and data for Cell-o1.☆25Updated last month
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆227Updated last week
- ☆110Updated last month
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆32Updated 3 months ago
- Large Language Models in Protein: A Comprehensive Survey☆144Updated 7 months ago
- A virtual lab of LLM agents for science research☆526Updated 3 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes [EMNLP 2024]☆28Updated 11 months ago
- STELLA: Self-Evolving LLM Agent for Biomedical Research☆57Updated last month
- Code for CTO: A Large Clinical Trial Outcome and QA Dataset☆28Updated this week
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 5 months ago
- ☆35Updated 2 years ago
- KGARevion: AI Agent for Knowledge-Intensive Biomedical QA☆38Updated 8 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆120Updated last year
- Official implementation of Panacea: A foundation model for clinical trial design, recruitment, search, and summarization.☆15Updated 10 months ago
- ☆29Updated last month
- ☆44Updated 6 months ago