allenai / asta-benchLinks
☆53Updated 3 weeks ago
Alternatives and similar repositories for asta-bench
Users that are interested in asta-bench are comparing it to the libraries listed below
Sorting:
- Discovering Data-driven Hypotheses in the Wild☆120Updated 6 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆112Updated 3 months ago
- This repository contains ScholarQABench data and evaluation pipeline.☆88Updated 3 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆101Updated last year
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated last year
- ☆96Updated this week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆220Updated last month
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆90Updated last year
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆70Updated 10 months ago
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆173Updated last week
- A benchmark that challenges language models to code solutions for scientific problems☆157Updated last week
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆96Updated last month
- ☆319Updated last year
- ☆200Updated last week
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆44Updated 8 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆209Updated 5 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆76Updated 6 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆192Updated 9 months ago
- A curated list of papers on LLMs and agents for scientific research and development☆80Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆36Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆109Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆112Updated 4 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 7 months ago
- ☆52Updated 8 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated last month
- ☆129Updated last year
- ☆35Updated 6 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆108Updated 2 weeks ago
- ☆36Updated 6 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆218Updated 6 months ago