hdaSprachtechnologie / odenet
Open German WordNet
☆88Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for odenet
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated 3 months ago
- ☆63Updated 5 months ago
- German Morphological Analyzer☆47Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆16Updated 5 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- This packages up data for the Open Multilingual Wordnet☆43Updated last week
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated this week
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆30Updated last year
- The Open Multilingual Wordnet☆58Updated 6 months ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- GermaNet API for Python☆53Updated 6 years ago
- A lemmatizer for German language text☆87Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated last year
- Python for Linguists – a Gentle Introduction to Programming☆44Updated 8 years ago
- Python framework for processing Universal Dependencies data☆56Updated last week
- UIMA CAS processing library written in Python☆85Updated 6 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 4 months ago
- CONLL-U to Pandas DataFrame☆31Updated 6 years ago
- LoanPy is a linguistic toolkit for rule-based prediction and evaluation of loanword adaptation and historical reconstructions and can be …☆15Updated 8 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- A character-wise tokenizer for morphologically rich languages☆27Updated 4 months ago
- Various utilities for processing the data.☆205Updated this week
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆15Updated 4 months ago
- Stemmer for German☆45Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago