ourresearch / openalex-pdf-parser
PDF parser powered by grobid
☆25Updated 8 months ago
Alternatives and similar repositories for openalex-pdf-parser:
Users that are interested in openalex-pdf-parser are comparing it to the libraries listed below
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆24Updated 6 months ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆61Updated 11 months ago
- Python library for the OpenAlex HTTP API☆23Updated 2 years ago
- Compute novelty indicators☆32Updated 9 months ago
- ☆54Updated last year
- link raw affiliation to ROR ids☆29Updated last year
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆91Updated last year
- SciRepEval benchmark training and evaluation scripts☆73Updated 10 months ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆16Updated 7 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- A BERT-based application for reusable text classification at scale☆38Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- The landscape of biomedical research☆115Updated 11 months ago
- Python API Wrapper for OpenAlex. Query OpenAlex for metadata in Python.☆19Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆32Updated 7 months ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆53Updated 6 months ago
- ☆67Updated last year
- Scripts used to make and evaluate OpenAlex's concept tagging model☆48Updated last year
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- Works-magnet: Retrieve and promote the scholarly works of your institution.☆21Updated last week
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago
- OpenAlex Networks is a helper library to process and obtain data from the OpenAlex dataset via API. It also provides functionality to gen…☆20Updated 2 years ago
- One downloader for many scientific data and code repositories! DOI Data☆71Updated this week
- PhD Dissertation "Automated Extraction and Curation of Materials Information from Scientific Literature"☆9Updated last year
- An easy way to chunk spaCy docs.☆19Updated 7 months ago
- A spaCy wrapper for GliNER☆112Updated 2 months ago
- Keeping It Simple is Hard☆10Updated last year
- ☆16Updated 8 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago