snap-stanford / POPPERLinks
Automated Hypothesis Testing with Agentic Sequential Falsifications
☆234Updated 6 months ago
Alternatives and similar repositories for POPPER
Users that are interested in POPPER are comparing it to the libraries listed below
Sorting:
- A language agent gym with challenging scientific tasks☆219Updated last month
- Framework enabling modular interchange of language agents, environments, and optimizers☆116Updated this week
- An aviary-based data science agent based on jupyter notebooks☆42Updated 2 months ago
- ☆58Updated 3 months ago
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆91Updated 5 months ago
- Robin: A multi-agent system for automating scientific discovery☆266Updated 2 weeks ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆76Updated 6 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆303Updated last week
- A virtual lab of LLM agents for science research☆567Updated 4 months ago
- ☆97Updated this week
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆93Updated 2 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆131Updated last year
- ☆575Updated 7 months ago
- Benchmark for LLM-based Agents in Computational Biology☆65Updated 2 months ago
- SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration☆239Updated 3 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 9 months ago
- ☆38Updated last year
- BioMCP: Biomedical Model Context Protocol☆371Updated 2 weeks ago
- Democratizing AI scientists with ToolUniverse☆721Updated this week
- Data from BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology paper☆26Updated last year
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 6 months ago
- Discovering Data-driven Hypotheses in the Wild☆120Updated 6 months ago
- Official implementation for the paper "Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning"☆51Updated 6 months ago
- ☆34Updated last year
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆107Updated last week
- BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25☆326Updated 2 weeks ago
- ☆278Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆207Updated 3 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆43Updated 11 months ago
- ☆79Updated 2 months ago