BeelGroup / GIANT-The-1-Billion-Annotated-Synthetic-Bibliographic-Reference-String-Dataset
A script to generate tagged XML Citationstrings for citation parsing
☆18Updated 4 years ago
Related projects: ⓘ
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆72Updated 7 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆26Updated last year
- ☆34Updated 2 years ago
- Neuralized version of the Reference String Parser component of the ParsCit package.☆78Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- A Python library for topic modeling and visualization☆64Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 2 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆42Updated this week
- Digital Humanities Across Borders☆46Updated 5 months ago
- Python library for the OpenAlex HTTP API☆21Updated last year
- A Named-Entity Recogniser based on Grobid.☆48Updated this week
- Detect and align similar passages☆86Updated 2 weeks ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆67Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated 10 months ago
- ☆31Updated last year
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆17Updated this week
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆65Updated this week
- Linguistic and stylistic complexity measures for (literary) texts☆76Updated 7 months ago
- UIMA CAS processing library written in Python☆84Updated 4 months ago
- Python tools for interacting with Wikidata☆139Updated 10 months ago
- An R package for analysis of dramatic texts☆15Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- ☆28Updated 3 years ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆112Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated last week
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆124Updated this week
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆67Updated 3 years ago