NLPatVCU / PaperScraperLinks
A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
☆212Updated 2 years ago
Alternatives and similar repositories for PaperScraper
Users that are interested in PaperScraper are comparing it to the libraries listed below
Sorting:
- A python library that implements the Crossref API.☆323Updated 3 months ago
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆419Updated last month
- PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, SciHub, and SciDB.☆551Updated 9 months ago
- Uses publisher APIs to programmatically retrieve scientific journal articles for text mining.☆134Updated last year
- A proof of concept to scrape papers from journals☆285Updated last year
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆72Updated last year
- An unofficial api for downloading papers from SciHub via DOI, PMID, title☆282Updated last year
- Python-based API-Wrapper to access Scopus☆468Updated last month
- Search for and retrieve US Patent and Trademark Office Patent Data☆82Updated 5 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- A Python module for use with Elsevier's APIs: Scopus, ScienceDirect, others.☆412Updated 2 years ago
- client for Crossref search API☆232Updated 2 months ago
- PyMed is a Python library that provides access to PubMed.☆212Updated 3 years ago
- Extract data from all Google Scholar pages from a single Python module.☆116Updated 2 months ago
- a Python version of getpapers☆87Updated 2 months ago
- ☆20Updated 6 months ago
- A Python library for OpenAlex (openalex.org)☆290Updated 3 months ago
- LitStudy: Using the power of Python to automate scientific literature analysis from the comfort of a Jupyter notebook☆198Updated 4 months ago
- Python toolkit for NCBI metadata (via eutils) and pubmed article text mining -- official primary repo.☆128Updated last month
- A Python package to download full article PDFs from OA publications☆49Updated 8 months ago
- litreviewer is a Python package (collection of few Python modules) that helps researchers perform crawling, scraping, collecting (corpus)…☆45Updated last year
- Python client for GROBID Web services☆364Updated this week
- Automatic synthesis of RCTs☆162Updated 3 years ago
- Automatically extract chemical information from scientific documents☆334Updated 2 years ago
- Scientific literature explorer. Runs a Pubmed or Semantic Scholar search and allows user to explore high-level structure of result papers☆48Updated last month
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆439Updated last year
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆144Updated 3 months ago
- ☆30Updated 4 years ago
- Simple python parser for MEDLINE, Pubmed OA affiliation string☆38Updated 4 years ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆974Updated last year