Future-House / LitQALinks
LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer
☆41Updated 9 months ago
Alternatives and similar repositories for LitQA
Users that are interested in LitQA are comparing it to the libraries listed below
Sorting:
- Evaluation dataset for AI systems intended to benchmark capabilities foundational to scientific research in biology☆83Updated 2 months ago
- Data from BioPlanner: Automatic Evaluation of LLMs on Protocol Planning in Biology paper☆26Updated last year
- Benchmark for LLM-based Agents in Computational Biology☆51Updated 3 months ago
- A language agent gym with challenging scientific tasks☆202Updated this week
- Framework enabling modular interchange of language agents, environments, and optimizers☆106Updated this week
- ☆38Updated 10 months ago
- An aviary-based data science agent based on jupyter notebooks☆35Updated 3 months ago
- ChemNLP project☆164Updated this week
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆412Updated last month
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆117Updated last year
- Papers about scientific hypothesis generation with large language models (LLMs).☆74Updated 3 months ago
- ☆178Updated last month
- ☆77Updated 2 weeks ago
- Discovering Data-driven Hypotheses in the Wild☆110Updated 3 months ago
- Code for CTO: A Large Clinical Trial Outcome and QA Dataset☆26Updated last month
- ☆18Updated 6 months ago
- Official code repo for the paper "LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality …☆97Updated 3 months ago
- A scientific reasoning model, dataset, and reward functions for chemistry.☆130Updated 2 months ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆284Updated 10 months ago
- reasoning model trained using GRPO towards rosetta REF2015 for protein stability☆91Updated last month
- BioDiscoveryAgent is an LLM-based AI agent for closed-loop design of genetic perturbation experiments☆87Updated 2 months ago
- BioCoder: A Benchmark for Bioinformatics Code Generation with Large Language Models https://arxiv.org/abs/2308.16458☆51Updated last month
- ☆30Updated 2 years ago
- [ICML 25] We train and evaluate SAEs to identify interpretable features in pLMs and show their potential for scientific discovery.☆117Updated 4 months ago
- A proof of concept to scrape papers from journals☆288Updated last year
- ☆50Updated 11 months ago
- ☆56Updated last week
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆227Updated 4 months ago
- A virtual lab of LLM agents for science research☆468Updated last month
- [ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>☆48Updated 2 weeks ago