snap-stanford / POPPERLinks

Automated Hypothesis Testing with Agentic Sequential Falsifications

☆231

Alternatives and similar repositories for POPPER

Users that are interested in POPPER are comparing it to the libraries listed below

Sorting:

Future-House / aviary
A language agent gym with challenging scientific tasks
☆210Updated last week
Future-House / ldp
Framework enabling modular interchange of language agents, environments, and optimizers
☆114Updated this week
Future-House / data-analysis-crow
An aviary-based data science agent based on jupyter notebooks
☆41Updated last month
snap-stanford / BioDiscoveryAgent
BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments
☆90Updated 4 months ago
Paureel / LLM-SCI-GEN
Papers about scientific hypothesis generation with large language models (LLMs).
☆76Updated 5 months ago
kidzik / other-public-mcps
☆57Updated 2 months ago
allenai / codescientist
CodeScientist: An automated scientific discovery system for code-based experiments
☆300Updated 4 months ago
Future-House / robin
Robin: A multi-agent system for automating scientific discovery
☆250Updated last week
lamm-mit / PRefLexOR
Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning
☆233Updated 8 months ago
zou-group / virtual-lab
A virtual lab of LLM agents for science research
☆544Updated 3 months ago
allenai / discoverybench
Discovering Data-driven Hypotheses in the Wild
☆118Updated 5 months ago
lamm-mit / SciAgentsDiscovery
☆567Updated 6 months ago
mims-harvard / ToolUniverse
Democratizing AI scientists with ToolUniverse
☆680Updated this week
Future-House / BixBench
Benchmark for LLM-based Agents in Computational Biology
☆62Updated last month
allenai / discoveryworld
A virtual environment for developing and evaluating automated scientific discovery agents.
☆190Updated 8 months ago
allenai / autods
☆88Updated this week
genomoncology / biomcp
BioMCP: Biomedical Model Context Protocol
☆353Updated last week
yale-nlp / SciArena
Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"
☆55Updated 3 months ago
Future-House / LAB-Bench
Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology
☆91Updated last month
allenai / agent-baselines
☆86Updated 2 weeks ago
google-research / score
Code associated with the paper An AI system to help scientists write expert-level empirical software
☆96Updated 2 months ago
kyunghyuncho / pubmed-vectors
☆33Updated last year
HICAI-ZJU / SciToolAgent
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
☆221Updated 2 months ago
openlifescience-ai / Open-Medical-Reasoning-Tasks
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
☆130Updated last year
Future-House / WikiCrow
☆38Updated last year
ChicagoHAI / hypothesis-generation
This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…
☆91Updated last week
bowang-lab / BioReason
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25
☆323Updated last week
mingyin0312 / RL4GenomeBench
Official implementation for the paper "Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning"
☆52Updated 5 months ago
zjunlp / ChatCell
ChatCell: Facilitating Single-Cell Analysis with Natural Language
☆52Updated 5 months ago
OSU-NLP-Group / ScienceAgentBench
[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
☆109Updated 2 months ago