dimitryzub / scrape-google-scholar-py
Extract data from all Google Scholar pages from a single Python module. NOTE: I'm no longer maintaining this repo. Chrome driver/selectors might need and update.
☆101Updated last year
Alternatives and similar repositories for scrape-google-scholar-py:
Users that are interested in scrape-google-scholar-py are comparing it to the libraries listed below
- A collection of Jupyter notebooks, each walking you through a common example of bibliometric analysis using scholarly data from the OpenA…☆106Updated 9 months ago
- A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journal…☆193Updated 2 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆59Updated 9 months ago
- litreviewer is a Python package (collection of few Python modules) that helps researchers perform crawling, scraping, collecting (corpus)…☆40Updated 7 months ago
- client for Crossref search API☆215Updated 2 weeks ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆47Updated 6 months ago
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆70Updated last year
- Search for and retrieve US Patent and Trademark Office Patent Data☆79Updated 4 years ago
- A curated collection of resources on scholarly data analysis ranging from datasets, papers, and code about bibliometrics, citation analys…☆181Updated last week
- PyScopus☆23Updated last year
- A python library that implements the Crossref API.☆304Updated 4 months ago
- ☆103Updated 9 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆205Updated last year
- A Python library for OpenAlex (openalex.org)☆198Updated last week
- Scientific literature explorer. Runs a Pubmed or Semantic Scholar search and allows user to explore high-level structure of result papers☆40Updated last week
- A python library/command-line tool to extract the DOI or other identifiers of a scientific paper from a pdf file.☆114Updated 3 months ago
- OpenAlex Networks is a helper library to process and obtain data from the OpenAlex dataset via API. It also provides functionality to gen…☆19Updated last year
- ☆27Updated 3 years ago
- Compute novelty indicators☆31Updated 8 months ago
- Python API Wrapper for OpenAlex. Query OpenAlex for metadata in Python.☆19Updated 2 years ago
- All the OpenAlex API endpoints that are backed by Elasticsearch☆21Updated this week
- PyPaperBot is a Python tool for downloading scientific papers using Google Scholar, Crossref, SciHub, and SciDB.☆467Updated 2 months ago
- This web app aims to help scientists with their literature review using metadata from OpenAlex (OA), Semantic Scholar (S2) and Crossref (…☆116Updated 2 weeks ago
- Streaming responses with Streamlit, ChatGPT and Langchain.☆11Updated last year
- Pip-installable Python package to automate handsearching and citation searching for systematic reviews.☆12Updated 7 months ago
- Automatically download all PDF files of searching results & their patent families found on Google Patents.☆61Updated 2 years ago
- The WIPO Patent Analytics Handbook (Work in Progress)☆25Updated 2 years ago
- An unofficial api for downloading papers from SciHub via DOI, PMID, title☆226Updated last year
- A proof of concept to scrape papers from journals☆272Updated 8 months ago
- The GitBook documentation site for OpenAlex☆18Updated last month