spraakbanken / sparv-pipeline
Språkbanken's text analysis tool
☆25Updated this week
Alternatives and similar repositories for sparv-pipeline:
Users that are interested in sparv-pipeline are comparing it to the libraries listed below
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 5 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆27Updated 5 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 8 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- UIMA CAS processing library written in Python☆86Updated 8 months ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated 2 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Runnable morphological analysis tools from the UniMorph project☆15Updated 6 years ago
- Backend for Korp, Språkbanken's corpus search tool☆15Updated 4 months ago
- Plan and train German transformer models.☆23Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 6 months ago
- Digital Humanities Across Borders☆47Updated 10 months ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆57Updated 9 months ago
- A character-wise tokenizer for morphologically rich languages☆27Updated last month
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 5 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- This is a prototype of a Python module for simple modification of document files.☆17Updated 3 years ago
- Data for the HIPE 2022 shared task.☆16Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 5 months ago