snap-stanford / POPPERLinks
Automated Hypothesis Testing with Agentic Sequential Falsifications
☆242Updated 8 months ago
Alternatives and similar repositories for POPPER
Users that are interested in POPPER are comparing it to the libraries listed below
Sorting:
- A language agent gym with challenging scientific tasks☆232Updated this week
- Framework enabling modular interchange of language agents, environments, and optimizers☆119Updated this week
- An aviary-based data science agent based on jupyter notebooks☆43Updated 3 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆306Updated last month
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆95Updated 6 months ago
- ☆58Updated 4 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆79Updated 7 months ago
- Robin: A multi-agent system for automating scientific discovery☆275Updated last month
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆121Updated last week
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 10 months ago
- Discovering Data-driven Hypotheses in the Wild☆127Updated 7 months ago
- ☆578Updated 8 months ago
- ☆103Updated this week
- A virtual lab of LLM agents for science research☆604Updated 3 weeks ago
- ☆34Updated last year
- Benchmark for LLM-based Agents in Computational Biology☆68Updated 3 months ago
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆96Updated 3 months ago
- CRISPSR-GPT LLM Agent Public Welcome Page☆135Updated 5 months ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 7 months ago
- Kosmos: An AI Scientist for Autonomous Discovery - An implementation and adaptation to be driven by Claude Code or API - Based on the Kos…☆393Updated last month
- ☆93Updated 2 weeks ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated 11 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆220Updated 5 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆132Updated last year
- BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25☆355Updated 3 weeks ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆199Updated 10 months ago
- BioMCP: Biomedical Model Context Protocol☆395Updated last week
- ☆80Updated 3 months ago
- ☆107Updated last month