NLPatVCU / PaperScraper
A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
☆191Updated 2 years ago
Alternatives and similar repositories for PaperScraper:
Users that are interested in PaperScraper are comparing it to the libraries listed below
- Tools to scrape publication metadata from pubmed, arxiv, medrxiv and chemrxiv.☆285Updated this week
- Uses publisher APIs to programmatically retrieve scientific journal articles for text mining.☆119Updated last year
- Python client for GROBID Web services☆301Updated 2 weeks ago
- PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, SciHub, and SciDB.☆442Updated last month
- Python toolkit for NCBI metadata (via eutils) and pubmed article text mining -- official primary repo.☆101Updated 5 months ago
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆68Updated last year
- A Python library for OpenAlex (openalex.org)☆186Updated this week
- Extract data from all Google Scholar pages from a single Python module. NOTE: I'm no longer maintaining this repo. Chrome driver/selector…☆95Updated last year
- client for Crossref search API☆212Updated last month
- A python library that implements the Crossref API.☆296Updated 3 months ago
- A Python module for use with Elsevier's APIs: Scopus, ScienceDirect, others.☆379Updated 2 years ago
- Python-based API-Wrapper to access Scopus☆430Updated this week
- A collection of Jupyter notebooks, each walking you through a common example of bibliometric analysis using scholarly data from the OpenA…☆100Updated 8 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆351Updated 9 months ago
- PyMed is a Python library that provides access to PubMed.☆201Updated 3 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆57Updated 8 months ago
- This web app aims to help scientists with their literature review using metadata from OpenAlex (OA), Semantic Scholar (S2) and Crossref (…☆114Updated 2 months ago
- A curated collection of resources on scholarly data analysis ranging from datasets, papers, and code about bibliometrics, citation analys…☆179Updated last year
- A Python library for doing bibliometric and network analysis in science and health policy research☆166Updated 2 years ago
- ☆27Updated 3 years ago
- Automatic synthesis of RCTs☆145Updated 2 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆47Updated 4 months ago
- a Python version of getpapers☆81Updated 7 months ago
- Python library for the OpenAlex HTTP API☆23Updated last year
- litreviewer is a Python package (collection of few Python modules) that helps researchers perform crawling, scraping, collecting (corpus)…☆38Updated 6 months ago
- A Python Parser for PubMed Open-Access XML Subset and MEDLINE XML Dataset☆605Updated 3 weeks ago
- Companion code to the paper "Extracting Scientific Figures with Distantly Supervised Neural Networks" 🤖☆139Updated 2 years ago
- LitStudy: Using the power of Python to automate scientific literature analysis from the comfort of a Jupyter notebook☆176Updated 5 months ago
- Scripts used to make and evaluate OpenAlex's concept tagging model☆48Updated last year
- OpenAlex Networks is a helper library to process and obtain data from the OpenAlex dataset via API. It also provides functionality to gen…☆19Updated last year