blackadad / paper-scraper
A proof of concept to scrape papers from journals
☆276Updated 9 months ago
Alternatives and similar repositories for paper-scraper:
Users that are interested in paper-scraper are comparing it to the libraries listed below
- Tools to scrape publication metadata from pubmed, arxiv, medrxiv, biorxiv and chemrxiv.☆325Updated this week
- A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journal…☆196Updated 2 years ago
- Python PDF parser for scientific publications: content and figures☆399Updated last year
- ☆167Updated last year
- LitQA Eval: A difficult set of scientific questions that require context of full-text research papers to answer☆38Updated 3 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆220Updated 2 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆359Updated 11 months ago
- A toolkit for automatically extracting semantic information from PDF files of scientific articles☆72Updated last year
- Gymnasium framework for training language model agents on constructive tasks☆153Updated 2 weeks ago
- INSIGHT is an autonomous AI that can do medical research!☆407Updated last year
- LitLLM: A Toolkit for Scientific Literature Review☆55Updated last year
- Chemcrow☆722Updated 3 months ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆378Updated 2 months ago
- A langchain agent that retries☆48Updated last year
- Unofficial Python client library for Semantic Scholar APIs.☆358Updated last month
- ⚡ Automating scientific workflows with AI ⚡☆385Updated 7 months ago
- 🤖🌊 aiFlows: The building blocks of your collaborative AI☆254Updated 10 months ago
- ☆36Updated 5 months ago
- 🚀 gpt_pdf_md: Convert PDF to Markdown with GPT-4V & more. Extract images, upload to Google Cloud, & generate Markdown with images. Pytho…☆81Updated last year
- This is a public repository to enable researchers to begin their journey of self-hosting data from Semantic Scholar.☆41Updated 4 months ago
- ☆81Updated 11 months ago
- Fact-checking LLM outputs with self-ask☆299Updated last year
- Python client for GROBID Web services☆314Updated 3 weeks ago
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆171Updated 2 years ago
- Semantic search engine indexing 110 million academic publications☆80Updated 2 weeks ago
- A virtual lab of LLM agents for science research☆146Updated last month
- The GPT-4o Research Assistant is a tool designed to leverage the power of GPT-4o in assisting with academic research. It searches for aca…☆107Updated 2 months ago
- ☆197Updated last year
- ☆18Updated 6 months ago