allenai / asta-benchLinks
☆37Updated last week
Alternatives and similar repositories for asta-bench
Users that are interested in asta-bench are comparing it to the libraries listed below
Sorting:
- Code for CTO: A Large Clinical Trial Outcome and QA Dataset☆26Updated 2 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆75Updated 4 months ago
- ☆76Updated 2 weeks ago
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆55Updated 3 weeks ago
- A curated list of papers on LLMs and agents for scientific research and development☆76Updated 10 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆42Updated 10 months ago
- Data from BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology paper☆26Updated last year
- Discovering Data-driven Hypotheses in the Wild☆114Updated 4 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆106Updated last month
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆109Updated 9 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆85Updated last year
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated 2 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆229Updated 5 months ago
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆88Updated 3 weeks ago
- Call for participation in the impact of LLM for scientific discovery☆73Updated last year
- CUREBench @ NeurIPS 2025: Benchmarking AI reasoning for therapeutic decision-making at scale☆117Updated 2 weeks ago
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆48Updated 2 weeks ago
- Structured Chemistry Reasoning with Large Language Models☆38Updated last year
- NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆45Updated last month
- Biomedical Question Answering Datasets.☆114Updated 5 months ago
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆72Updated 2 months ago
- An aviary-based data science agent based on jupyter notebooks☆38Updated 3 weeks ago
- ☆16Updated 2 months ago
- ☆32Updated 8 months ago
- ☆49Updated last year
- Framework enabling modular interchange of language agents, environments, and optimizers☆109Updated this week
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆119Updated last year
- Dataset for Checking Consistency between Unstructured Notes and Structured Tables in Electronic Health Records☆23Updated last year
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆89Updated 3 weeks ago
- ☆34Updated 5 months ago