mims-harvard / CUREBenchLinks
CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale
☆104Updated 2 weeks ago
Alternatives and similar repositories for CUREBench
Users that are interested in CUREBench are comparing it to the libraries listed below
Sorting:
- ToolUniverse is a collection of biomedical tools designed for AI agents☆214Updated 2 months ago
- ICLR'24 | BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs☆74Updated last year
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆53Updated 9 months ago
- [ICML'25] MedTok: Multimodal Medical Code Tokenizer☆26Updated 2 months ago
- MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning☆57Updated last month
- A specialized LLM for study search, study screening, and data extraction from medical literature.☆12Updated 6 months ago
- [NeurIPS 2023] Official codes of "MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data"☆30Updated 2 months ago
- BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science☆21Updated 11 months ago
- ☆48Updated 7 months ago
- KGARevion: AI Agent for Knowledge-Intensive Biomedical QA☆36Updated 6 months ago
- ☆43Updated 10 months ago
- m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models☆42Updated 5 months ago
- BPfold: Deep generalizable prediction of RNA secondary structure via base pair motif energy.☆20Updated last week
- GenoTEX: A multi-task benchmark for LLM agentic methods in automated gene expression data analysis.☆24Updated 5 months ago
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆87Updated 2 months ago
- Awesome-Biomolecule-Language-Cross-Modeling: a curated list of resources for paper "Leveraging Biomolecule and Natural Language through M…☆224Updated 5 months ago
- [COLM 2024] Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation☆14Updated last year
- Code and data for Cell-o1.☆24Updated last week
- Paper list of agent for science☆106Updated 2 weeks ago
- A collection of resources and papers on AI Scientist / Robot Scientist☆95Updated 3 months ago
- [CVPR 2025] BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature☆83Updated 6 months ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆50Updated last year
- ☆31Updated 5 months ago
- [COLM'24] We propose Protein Chain of Thought (ProCoT), which replicates the biological mechanism of signaling pathways as language promp…☆66Updated 6 months ago
- MedMax: Mixed-Modal Instruction Tuning for Training Biomedical Assistants☆37Updated last week
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 3 months ago
- Expert-level AI radiology report evaluator☆34Updated 5 months ago
- Repo for the pape Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions☆41Updated 2 months ago
- Official implementation for the paper "Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning"☆51Updated 3 months ago
- ☆35Updated 2 years ago