wartaal / HanTa
The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models
☆51Updated last year
Alternatives and similar repositories for HanTa:
Users that are interested in HanTa are comparing it to the libraries listed below
- A lemmatizer for German language text☆87Updated 2 years ago
- A data set and model for german sentiment classification.☆66Updated 6 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆465Updated 3 months ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 10 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆236Updated 6 months ago
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆31Updated last year
- Sentence transformers models for SpaCy☆107Updated last year
- Plan and train German transformer models.☆23Updated 3 years ago
- German language support for TextBlob.☆103Updated last month
- A Dataset of German Legal Documents for Named Entity Recognition☆165Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆138Updated 2 months ago
- UIMA CAS processing library written in Python☆86Updated 9 months ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- Compound splitter for German☆104Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- Stemmer for German☆45Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆95Updated last month
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆212Updated last month