snap-stanford / POPPERLinks
Automated Hypothesis Testing with Agentic Sequential Falsifications
☆216Updated 2 months ago
Alternatives and similar repositories for POPPER
Users that are interested in POPPER are comparing it to the libraries listed below
Sorting:
- A language agent gym with challenging scientific tasks☆196Updated this week
- Framework enabling modular interchange of language agents, environments, and optimizers☆101Updated 2 weeks ago
- An aviary-based data science agent based on jupyter notebooks☆34Updated last month
- ☆46Updated 3 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆288Updated last month
- BioMCP: Biomedical Model Context Protocol☆233Updated this week
- A virtual lab of LLM agents for science research☆334Updated this week
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆80Updated last month
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆125Updated 10 months ago
- Papers about scientific hypothesis generation with large language models (LLMs).☆72Updated 2 months ago
- Robin: A multi-agent system for automating scientific discovery☆205Updated last month
- ☆33Updated last year
- ToolUniverse is a collection of biomedical tools designed for AI agents☆190Updated 3 weeks ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆226Updated 5 months ago
- ☆73Updated 5 months ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆52Updated 2 months ago
- Discovering Data-driven Hypotheses in the Wild☆104Updated 2 months ago
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆79Updated 3 weeks ago
- ☆548Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆118Updated 5 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆72Updated 8 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆45Updated this week
- Pretraining infrastructure for multi-hybrid AI model architectures☆177Updated 3 weeks ago
- ☆70Updated last week
- Benchmark for LLM-based Agents in Computational Biology☆47Updated last month
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆94Updated 2 months ago
- ☆146Updated 11 months ago
- BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model☆267Updated last month
- MIRIAD is a million scale Medical Instruction and RetrIeval Datatset☆107Updated last month
- A virtual environment for developing and evaluating automated scientific discovery agents.☆168Updated 4 months ago