ourresearch / openalex-pdf-parserLinks
PDF parser powered by grobid
☆28Updated last year
Alternatives and similar repositories for openalex-pdf-parser
Users that are interested in openalex-pdf-parser are comparing it to the libraries listed below
Sorting:
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- link raw affiliation to ROR ids☆30Updated 2 years ago
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆144Updated 3 months ago
- The guts for computing data for OpenAlex. For more, see https://openalex.org/.☆143Updated 3 weeks ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆95Updated last week
- A basic tool that extracts the structure from the PDF files of scientific articles.☆75Updated 3 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆57Updated last year
- ☆55Updated last year
- A collection of Jupyter notebooks, each walking you through a common example of bibliometric analysis using scholarly data from the OpenA…☆121Updated last year
- Viewer for the structure extracted by Grobid on PDF documents☆54Updated this week
- A Python library for OpenAlex (openalex.org)☆289Updated 2 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated last year
- Python client for GROBID Web services☆358Updated this week
- Scripts used to make and evaluate OpenAlex's concept tagging model☆49Updated 2 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆29Updated 3 months ago
- SciRepEval benchmark training and evaluation scripts☆76Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- ☆67Updated last year
- A curated collection of resources on scholarly data analysis ranging from datasets, papers, and code about bibliometrics, citation analys…☆192Updated last month
- Scientific Document Insight Q/A☆30Updated 2 weeks ago
- ☆40Updated last month
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- A spaCy wrapper for GliNER☆118Updated 7 months ago
- Robust and fast topic models with sentence-transformers.☆80Updated this week
- ☆102Updated last year
- GROBID extension for identifying and normalizing physical quantities.☆82Updated 3 months ago
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆125Updated 3 months ago
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- Simple python parser for MEDLINE, Pubmed OA affiliation string☆38Updated 4 years ago
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆122Updated last month