snap-stanford / POPPERLinks
Automated Hypothesis Testing with Agentic Sequential Falsifications
☆219Updated 3 months ago
Alternatives and similar repositories for POPPER
Users that are interested in POPPER are comparing it to the libraries listed below
Sorting:
- An aviary-based data science agent based on jupyter notebooks☆35Updated 2 months ago
- A language agent gym with challenging scientific tasks☆201Updated last week
- Framework enabling modular interchange of language agents, environments, and optimizers☆104Updated this week
- A virtual lab of LLM agents for science research☆429Updated 3 weeks ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆289Updated 2 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆73Updated 2 months ago
- ☆48Updated 4 months ago
- ☆33Updated last year
- Discovering Data-driven Hypotheses in the Wild☆104Updated 2 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆126Updated 11 months ago
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆85Updated last month
- ☆79Updated 3 weeks ago
- ☆551Updated 3 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆98Updated 2 months ago
- BioMCP: Biomedical Model Context Protocol☆272Updated last week
- Robin: A multi-agent system for automating scientific discovery☆216Updated 2 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆229Updated 6 months ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 2 months ago
- Benchmark for LLM-based Agents in Computational Biology☆50Updated 2 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆41Updated 8 months ago
- SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration☆115Updated this week
- BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model☆276Updated 2 months ago
- ☆43Updated 10 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆118Updated 6 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated 9 months ago
- Pretraining infrastructure for multi-hybrid AI model architectures☆181Updated last month
- High accuracy RAG for answering questions from scientific documents with citations☆37Updated 11 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆180Updated 5 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆47Updated 3 weeks ago
- BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance.☆52Updated last year