jannisborn / paperscraper
Tools to scrape publication metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.
☆316Updated 2 weeks ago
Alternatives and similar repositories for paperscraper:
Users that are interested in paperscraper are comparing it to the libraries listed below
- ☆166Updated last year
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆220Updated last month
- ChemNLP project☆159Updated this week
- Python PDF parser for scientific publications: content and figures☆396Updated 11 months ago
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆38Updated 2 months ago
- A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journal…☆195Updated 2 years ago
- Python toolkit for NCBI metadata (via eutils) and pubmed article text mining -- official primary repo.☆108Updated last month
- ☆81Updated 11 months ago
- A virtual lab of LLM agents for science research☆144Updated 3 weeks ago
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- Papers about scientific hypothesis generation with large language models (LLMs).☆57Updated 2 weeks ago
- [ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language Models☆269Updated 4 months ago
- The landscape of biomedical research☆114Updated 10 months ago
- Gymnasium framework for training language model agents on constructive tasks☆151Updated this week
- Code and data for the publication "Structured information extraction from scientific text with large language models" by Dagdelen & Dunn …☆92Updated last year
- Incorporating distribution of experts in order to better predict the future discovery of novel scientific connections☆29Updated last year
- ☆220Updated last month
- Ankh: Optimized Protein Language Model☆220Updated last year
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆71Updated last year
- A Python library for OpenAlex (openalex.org)☆210Updated this week
- Chemcrow☆710Updated 2 months ago
- SciRepEval benchmark training and evaluation scripts☆72Updated 9 months ago
- Multi-modal Molecule Structure-text Model for Text-based Editing and Retrieval, Nat Mach Intell 2023 (https://www.nature.com/articles/s42…☆220Updated 2 months ago
- BioT5 (EMNLP 2023) and BioT5+ (ACL 2024 Findings)☆106Updated 5 months ago
- bert-loves-chemistry: a repository of HuggingFace models applied on chemical SMILES data for drug design, chemical modelling, etc.☆437Updated 4 months ago
- A very fast visualization library for large, high-dimensional data sets.☆220Updated 3 months ago
- Python client for GROBID Web services☆314Updated last week
- Molecular dynamics simulations with an LLM agent☆174Updated this week
- BERN2: an advanced neural biomedical namedentity recognition and normalization tool☆182Updated 11 months ago
- ☆36Updated 4 months ago