LeonieWeissweiler / CISTEM
Stemmer for German
☆45Updated 2 years ago
Alternatives and similar repositories for CISTEM:
Users that are interested in CISTEM are comparing it to the libraries listed below
- Open German WordNet☆89Updated 11 months ago
- German stopwords collection☆86Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆137Updated last month
- small Java library for splitting German compound words☆61Updated 8 months ago
- Compound splitter for German☆104Updated 4 years ago
- The Zurich Dependency Parser for German☆82Updated 2 years ago
- ☆18Updated last week
- Plan and train German transformer models.☆23Updated 3 years ago
- A lemmatizer for German language text☆87Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆49Updated last year
- Multi Tier Annotation Search☆26Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- An unsupervised compound splitter☆41Updated 5 years ago
- UIMA CAS processing library written in Python☆86Updated 8 months ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago
- Extended list of German stopwords for use in Web Projects, Search Engines or every thing else.☆101Updated 5 years ago
- German language support for TextBlob.☆105Updated last week
- The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models…☆49Updated last year
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- A machine learning tool for fishing entities☆253Updated last week
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆82Updated 2 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Multi Tier Annotation Search☆12Updated 8 months ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Updated 5 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆210Updated this week
- This packages up data for the Open Multilingual Wordnet☆44Updated this week