allenai / asta-benchLinks
☆44Updated last week
Alternatives and similar repositories for asta-bench
Users that are interested in asta-bench are comparing it to the libraries listed below
Sorting:
- ☆86Updated 2 weeks ago
- A curated list of papers on LLMs and agents for scientific research and development☆77Updated 11 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆76Updated 5 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆69Updated 10 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆109Updated 2 months ago
- Discovering Data-driven Hypotheses in the Wild☆118Updated 5 months ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆114Updated this week
- This repository contains ScholarQABench data and evaluation pipeline.☆85Updated 3 months ago
- Data from BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology paper☆26Updated last year
- A language agent gym with challenging scientific tasks☆212Updated 2 weeks ago
- ☆35Updated 5 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆153Updated last week
- Structured Chemistry Reasoning with Large Language Models☆39Updated last year
- Optimize Any User-defined Compound AI Systems☆62Updated 3 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆43Updated 11 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆231Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆218Updated 2 weeks ago
- LLM for Scientific Research Survey☆113Updated 9 months ago
- Biomedical Question Answering Datasets.☆119Updated 6 months ago
- A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scienti…☆59Updated last month
- A collection of resources and papers on AI Scientist / Robot Scientist☆107Updated last month
- ☆222Updated 8 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated last year
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆91Updated last week
- [EMNLP'24] EHRAgent: Code Empowers Large Language Models for Complex Tabular Reasoning on Electronic Health Records☆112Updated 10 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated 2 weeks ago
- Code for CTO: A Large Clinical Trial Outcome and QA Dataset☆28Updated 2 weeks ago
- Call for participation in the impact of LLM for scientific discovery☆73Updated last year
- [ICLR 2025]ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning https://arxiv.org/abs/2501.06590☆73Updated 3 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆190Updated 8 months ago