Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
☆242Mar 19, 2026Updated this week
Alternatives and similar repositories for GermanWordEmbeddings
Users that are interested in GermanWordEmbeddings are comparing it to the libraries listed below
Sorting:
- A lemmatizer for German language text☆94Feb 7, 2023Updated 3 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Jun 15, 2018Updated 7 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆518Oct 30, 2024Updated last year
- Python port for IWNLP.Lemmatizer☆18Oct 18, 2023Updated 2 years ago
- German sentiment scores with SentiWS as extension for spaCy☆38Nov 26, 2022Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Dec 16, 2023Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆87Nov 7, 2022Updated 3 years ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- The Potsdam Twitter Sentiment Corpus☆18Jan 15, 2020Updated 6 years ago
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆38Jun 1, 2023Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆26Jul 28, 2023Updated 2 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆35Jul 7, 2022Updated 3 years ago
- GermaNet API for Python☆54Mar 8, 2018Updated 8 years ago
- Coreference resolution for German☆16Jun 26, 2017Updated 8 years ago
- IWNLP: A parser for the German edition of Wiktionary☆13Jul 28, 2023Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆159Dec 6, 2022Updated 3 years ago
- Plan and train German transformer models.☆23Feb 22, 2021Updated 5 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Jul 17, 2020Updated 5 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Apr 25, 2024Updated last year
- Poems retrieval demo built with GNES framework☆14Oct 3, 2019Updated 6 years ago
- German language support for TextBlob.☆102Jan 7, 2025Updated last year
- A Python library for topic modeling and visualization☆67Sep 20, 2020Updated 5 years ago
- This is a prototype of a semi-automatic data anonymization app for German documents. ➡️ The project has moved to: https://gitlab.opencode…☆24Updated this week
- ☆12Jan 27, 2026Updated last month
- A Dataset of German Legal Documents for Named Entity Recognition☆177Oct 19, 2022Updated 3 years ago
- Compound splitter for German☆113Apr 5, 2020Updated 5 years ago
- I analysed online user comments on articles by German news publishers SPON, ZEIT, and Focus☆19Feb 3, 2018Updated 8 years ago
- An R data package containing georeferenced events of right-wing violence in Germany from 2014 onwards☆11Jun 27, 2018Updated 7 years ago
- Automatic Limerick Generation☆11Mar 18, 2021Updated 5 years ago
- An unsupervised compound splitter☆42Oct 6, 2019Updated 6 years ago
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆23Feb 22, 2022Updated 4 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 7 years ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Mar 23, 2018Updated 7 years ago
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Jun 28, 2018Updated 7 years ago
- Information extraction from English and German texts based on predicate logic☆393Jul 8, 2022Updated 3 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 4 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- German Morphological Analyzer☆52Nov 12, 2021Updated 4 years ago