ourresearch / openalex-pdf-parser
PDF parser powered by grobid
☆26Updated 9 months ago
Alternatives and similar repositories for openalex-pdf-parser:
Users that are interested in openalex-pdf-parser are comparing it to the libraries listed below
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆24Updated 6 months ago
- ☆54Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆60Updated last year
- link raw affiliation to ROR ids☆30Updated last year
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆54Updated 7 months ago
- PhD Dissertation "Automated Extraction and Curation of Materials Information from Scientific Literature"☆9Updated last year
- A basic tool that extracts the structure from the PDF files of scientific articles.☆74Updated 3 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- ☆31Updated last year
- A BERT-based application for reusable text classification at scale☆38Updated last year
- An easy way to chunk spaCy docs.☆20Updated 8 months ago
- SciRepEval benchmark training and evaluation scripts☆74Updated 11 months ago
- ☆87Updated 11 months ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- A deep learning model for extracting references from text☆28Updated last year
- ☆67Updated last year
- Scientific Document Insight Q/A☆29Updated last month
- Compute novelty indicators☆33Updated 10 months ago
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆137Updated 7 months ago
- Python library for the OpenAlex HTTP API☆23Updated 2 years ago
- Keeping It Simple is Hard☆10Updated last year
- Python API Wrapper for OpenAlex. Query OpenAlex for metadata in Python.☆19Updated 2 years ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆21Updated 4 years ago
- The CleanCoNLL dataset from our EMNLP 2023 paper where we corrected annotation errors and inconsistencies in CoNLL-03.☆24Updated 10 months ago
- Discourse Analysis Tool Suite☆21Updated this week
- One downloader for many scientific data and code repositories! DOI Data☆73Updated last week
- Works-magnet: Retrieve and promote the scholarly works of your institution.☆22Updated 2 weeks ago
- Viewer for the structure extracted by Grobid on PDF documents☆49Updated this week
- Finds linguistic patterns effortlessly☆36Updated last year