ourresearch / openalex-pdf-parser
PDF parser powered by grobid
☆25Updated 7 months ago
Alternatives and similar repositories for openalex-pdf-parser:
Users that are interested in openalex-pdf-parser are comparing it to the libraries listed below
- link raw affiliation to ROR ids☆27Updated last year
- ☆54Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆14Updated 6 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆59Updated 10 months ago
- SciRepEval benchmark training and evaluation scripts☆72Updated 9 months ago
- One downloader for many scientific data and code repositories! DOI Data☆70Updated last week
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆24Updated 4 months ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆53Updated 5 months ago
- An easy way to chunk spaCy docs.☆19Updated 6 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago
- Compute novelty indicators☆31Updated 8 months ago
- Python library for the OpenAlex HTTP API☆23Updated 2 years ago
- Scripts used to make and evaluate OpenAlex's concept tagging model☆48Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- Ricgraph - Research in context graph☆27Updated this week
- Python API Wrapper for OpenAlex. Query OpenAlex for metadata in Python.☆19Updated 2 years ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆90Updated last year
- OpenAlex Networks is a helper library to process and obtain data from the OpenAlex dataset via API. It also provides functionality to gen…☆19Updated last year
- Viewer for the structure extracted by Grobid on PDF documents☆46Updated 3 weeks ago
- Keeping It Simple is Hard☆10Updated last year
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆19Updated 3 years ago
- ☆46Updated last week
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- Downloader, preprocessor, parser and deduper for NIH and NSF grants☆20Updated 6 years ago
- Knowledge Graph Generator app☆30Updated 10 months ago
- A deep learning model for extracting references from text☆27Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 10 months ago
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- Jupyter notebooks with examples of querying different PID graphs and providers like OpenAlex, FREYA PID Graph, OpenAIRE, ORCID, ROR, Cros…☆23Updated 2 years ago
- A spaCy wrapper for GliNER☆108Updated last month