NLPatVCU / PaperScraper
A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
☆197Updated 2 years ago
Alternatives and similar repositories for PaperScraper:
Users that are interested in PaperScraper are comparing it to the libraries listed below
- Uses publisher APIs to programmatically retrieve scientific journal articles for text mining.☆123Updated last year
- Tools to scrape publications & their metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆336Updated last week
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆72Updated last year
- A python library that implements the Crossref API.☆310Updated 6 months ago
- client for Crossref search API☆219Updated 3 weeks ago
- A Python library for OpenAlex (openalex.org)☆226Updated last week
- Python toolkit for NCBI metadata (via eutils) and pubmed article text mining -- official primary repo.☆113Updated 2 months ago
- Extract data from all Google Scholar pages from a single Python module. NOTE: I'm no longer maintaining this repo. Chrome driver/selector…☆104Updated last year
- PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, SciHub, and SciDB.☆493Updated 4 months ago
- A collection of Jupyter notebooks, each walking you through a common example of bibliometric analysis using scholarly data from the OpenA…☆113Updated 11 months ago
- litreviewer is a Python package (collection of few Python modules) that helps researchers perform crawling, scraping, collecting (corpus)…☆41Updated 9 months ago
- Python-based API-Wrapper to access Scopus☆442Updated this week
- An unofficial api for downloading papers from SciHub via DOI, PMID, title☆247Updated last year
- a Python version of getpapers☆84Updated 10 months ago
- This web app aims to help scientists with their literature review using metadata from OpenAlex (OA), Semantic Scholar (S2) and Crossref (…☆120Updated 3 weeks ago
- Simple python parser for MEDLINE, Pubmed OA affiliation string☆37Updated 3 years ago
- A Python module for use with Elsevier's APIs: Scopus, ScienceDirect, others.☆392Updated 2 years ago
- Extract a citation network from Google Scholar☆163Updated 6 months ago
- PyMed is a Python library that provides access to PubMed.☆208Updated 3 years ago
- A Python package to download full article PDFs from OA publications☆41Updated 3 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated 11 months ago
- A curated collection of resources on scholarly data analysis ranging from datasets, papers, and code about bibliometrics, citation analys…☆185Updated 2 months ago
- Public release of data and code for materials synthesis generation☆73Updated 2 years ago
- Automatically extract chemical information from scientific documents☆322Updated last year
- Science of Science☆174Updated 3 weeks ago
- Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.☆34Updated 6 years ago
- Automatic synthesis of RCTs☆149Updated 2 years ago
- LitStudy: Using the power of Python to automate scientific literature analysis from the comfort of a Jupyter notebook☆184Updated 8 months ago
- Scientific literature explorer. Runs a Pubmed or Semantic Scholar search and allows user to explore high-level structure of result papers☆41Updated this week
- https://doi.org/10.1093/bioinformatics/btz228☆39Updated 4 months ago