hdaSprachtechnologie / odenet
Open German WordNet
☆94Updated last year
Alternatives and similar repositories for odenet:
Users that are interested in odenet are comparing it to the libraries listed below
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆25Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆140Updated 4 months ago
- A lemmatizer for German language text☆88Updated 2 years ago
- UIMA CAS processing library written in Python☆88Updated last month
- Compound splitter for German☆104Updated 5 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆21Updated last month
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- The Open Multilingual Wordnet☆61Updated 11 months ago
- German Morphological Analyzer☆47Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆30Updated last month
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 5 years ago
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆34Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Plan and train German transformer models.☆23Updated 4 years ago
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- This packages up data for the Open Multilingual Wordnet☆48Updated this week
- ☆64Updated 11 months ago
- GermaNet API for Python☆53Updated 7 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆377Updated 5 months ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated last week
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 11 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆126Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- spaCy + UDPipe☆161Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated last year