snap-stanford / POPPER
Automated Hypothesis Testing with Agentic Sequential Falsifications
☆160Updated last month
Alternatives and similar repositories for POPPER:
Users that are interested in POPPER are comparing it to the libraries listed below
- Gymnasium framework for training language model agents on constructive tasks☆155Updated 3 weeks ago
- Agent framework for constructing language model agents and training on constructive tasks.☆68Updated last week
- A virtual lab of LLM agents for science research☆148Updated last month
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆53Updated 4 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆191Updated last month
- Papers about scientific hypothesis generation with large language models (LLMs).☆59Updated last month
- ☆30Updated 9 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆116Updated 6 months ago
- ☆37Updated 5 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆38Updated 3 months ago
- An aviary-based data science agent based on jupyter notebooks☆12Updated 2 weeks ago
- ToolUniverse is a collection of biomedical tools designed for AI agents☆81Updated last week
- ☆66Updated 5 months ago
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆41Updated last month
- ☆61Updated last month
- 🧠🔗 Graph-Based Programmable Neuro-Symbolic LM Framework - a production-first LM framework built with decade old Deep Learning best prac…☆141Updated last week
- ☆500Updated last month
- A user interface for DSPy☆140Updated 5 months ago
- ☆99Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆76Updated 6 months ago
- ChatCell: Facilitating Single-Cell Analysis with Natural Language☆48Updated last year
- Automating enterprise workflows with multimodal agents☆102Updated 5 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆63Updated 3 months ago
- MedEmbed is a collection of embedding models fine-tuned specifically for medical and clinical data.☆45Updated 5 months ago
- reasoning model trained using GRPO towards rosetta REF2015 for protein stability☆60Updated last week
- Train your own SOTA deductive reasoning model☆81Updated 3 weeks ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- ☆88Updated 2 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆221Updated 5 months ago
- Code for MedCPT, a model for zero-shot biomedical information retrieval.☆169Updated last year