UniversalDependencies / UD_German-GSDLinks
☆20Updated last month
Alternatives and similar repositories for UD_German-GSD
Users that are interested in UD_German-GSD are comparing it to the libraries listed below
Sorting:
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 4 years ago
- UIMA CAS processing library written in Python☆90Updated last month
- Named entity annotation tool☆28Updated 2 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆19Updated last year
- OCRopus model for Gothic print (Fraktur)☆19Updated 5 years ago
- German lemmatization with IWNLP as extension for spaCy☆26Updated 2 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆21Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆86Updated 3 years ago
- German Morphological Analyzer☆51Updated 4 years ago
- A lemmatizer for German language text☆94Updated 2 years ago
- Python framework for processing Universal Dependencies data☆58Updated this week
- spaCy + UDPipe☆164Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆169Updated 3 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Updated 7 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 5 years ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- Various utilities for processing the data.☆215Updated this week
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated 2 months ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆20Updated 6 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆153Updated 2 weeks ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆21Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 3 months ago