wartaal / HanTa
The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models
☆47Updated last year
Related projects: ⓘ
- A lemmatizer for German language text☆87Updated last year
- German sentiment scores with SentiWS as extension for spaCy☆36Updated last year
- German language support for TextBlob.☆103Updated 3 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆81Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated last month
- ☆18Updated 3 weeks ago
- A data set and model for german sentiment classification.☆61Updated last month
- UIMA CAS processing library written in Python☆84Updated 4 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆35Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated last year
- German lemmatization with IWNLP as extension for spaCy☆23Updated last year
- Legal Reference Extraction☆26Updated last month
- spaCy + UDPipe☆159Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆234Updated 3 weeks ago
- Compound splitter for German☆102Updated 4 years ago
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.☆249Updated 2 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆446Updated 3 weeks ago
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- German Morphological Analyzer☆45Updated 2 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆151Updated last year
- Library for unit extraction - fork of quantulum for python3☆134Updated 2 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆72Updated 2 months ago
- Language Models for Zalando's flair library☆62Updated 4 years ago
- Python port for IWNLP.Lemmatizer☆17Updated 11 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆153Updated last year