Discovering Data-driven Hypotheses in the Wild
☆144Jun 9, 2025Updated 11 months ago
Alternatives and similar repositories for discoverybench
Users that are interested in discoverybench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆137Apr 29, 2026Updated 3 weeks ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆35Oct 25, 2024Updated last year
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆44Oct 28, 2024Updated last year
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆106Jul 6, 2025Updated 10 months ago
- Repository containing dataset, models and code associated with the CHIME project☆17Aug 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Dataset and annotations for ASSETS 2022 publication☆12Oct 6, 2022Updated 3 years ago
- EmbedGEM: A framework to evaluate the utility of embeddings for genetic discovery☆23Oct 3, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- Benchmark agents on BioML tasks☆68Sep 14, 2025Updated 8 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆265May 14, 2025Updated last year
- A curated list of papers on LLMs and agents for scientific research and development☆90Dec 11, 2024Updated last year
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆34Apr 21, 2025Updated last year
- A standard library for biological research.☆37Sep 2, 2025Updated 8 months ago
- A benchmark that challenges language models to code solutions for scientific problems☆196Apr 27, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- Pytorch implementation of DeepNovoV2, a state-of-the-art de novo peptide sequencing model.☆27May 21, 2019Updated 7 years ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆1,056Apr 26, 2024Updated 2 years ago
- Benchmark for LLM-based Agents in Computational Biology☆108Oct 6, 2025Updated 7 months ago
- Headway - Selenium Maven TestNG POM Data Driven Framework☆18Jul 2, 2025Updated 10 months ago
- Reasoning by Communicating with Agents☆29Apr 29, 2025Updated last year
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆359Dec 3, 2025Updated 5 months ago
- ☆64Apr 25, 2020Updated 6 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆29Jul 23, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24May 31, 2024Updated last year
- Code/data for MARG (multi-agent review generation)☆62Mar 5, 2026Updated 2 months ago
- ☆12Feb 11, 2026Updated 3 months ago
- ☆20Jan 18, 2022Updated 4 years ago
- SciQAG is a novel framework for automatically generating high-quality science question-answer pairs from a large corpus of scientific lit…☆34Mar 24, 2025Updated last year
- ☆10Jun 1, 2024Updated last year
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 11 months ago
- ☆33Feb 11, 2025Updated last year
- code for the NAACL 2021 paper Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention by Microsoft S…☆12Apr 21, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆38Oct 24, 2024Updated last year
- Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)☆14Jan 5, 2022Updated 4 years ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆34Oct 8, 2025Updated 7 months ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- ☆26Updated this week
- ☆285May 18, 2026Updated last week
- ☆29Mar 22, 2024Updated 2 years ago