Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
☆242Mar 19, 2026Updated last month
Alternatives and similar repositories for GermanWordEmbeddings
Users that are interested in GermanWordEmbeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- Language Model and Text Classification for German Language using Deep Learning☆18Jun 15, 2018Updated 7 years ago
- Any contributions to the NLTK project☆29May 8, 2014Updated 11 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆521Oct 30, 2024Updated last year
- Python port for IWNLP.Lemmatizer☆19Apr 13, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- German sentiment scores with SentiWS as extension for spaCy☆38Apr 13, 2026Updated 3 weeks ago
- GermaNER: Free Open German Named Entity Recognition Tool☆37Dec 16, 2023Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆87Nov 7, 2022Updated 3 years ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- Parser für die Plenarprotokolle des Bundestags☆21Jul 17, 2017Updated 8 years ago
- The Potsdam Twitter Sentiment Corpus☆18Jan 15, 2020Updated 6 years ago
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆38Jun 1, 2023Updated 2 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆35Jul 7, 2022Updated 3 years ago
- Developer portfolio website. Small, old-fashioned, but beautiful and lightning fast!☆18Apr 20, 2026Updated 2 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆158Dec 6, 2022Updated 3 years ago
- Plan and train German transformer models.☆23Feb 22, 2021Updated 5 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Jul 17, 2020Updated 5 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python. ➡️ The project has moved to: https://gitlab.opencode…☆21Mar 20, 2026Updated last month
- A list of ~100,000 German nouns and their grammatical properties compiled from WiktionaryDE as CSV file. Plus a module to look up the dat…☆168Dec 29, 2024Updated last year
- Poems retrieval demo built with GNES framework☆14Oct 3, 2019Updated 6 years ago
- Testing different approaches to improve PHP script performance☆55Aug 17, 2023Updated 2 years ago
- German language support for TextBlob.☆103Jan 7, 2025Updated last year
- This is a prototype of a semi-automatic data anonymization app for German documents. ➡️ The project has moved to: https://gitlab.opencode…☆24Mar 20, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Jan 27, 2026Updated 3 months ago
- A Dataset of German Legal Documents for Named Entity Recognition☆177Oct 19, 2022Updated 3 years ago
- Compound splitter for German☆113Apr 5, 2020Updated 6 years ago
- I analysed online user comments on articles by German news publishers SPON, ZEIT, and Focus☆19Feb 3, 2018Updated 8 years ago
- An R data package containing georeferenced events of right-wing violence in Germany from 2014 onwards☆11Jun 27, 2018Updated 7 years ago
- Automatic Limerick Generation☆11Mar 18, 2021Updated 5 years ago
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- An unsupervised compound splitter☆42Oct 6, 2019Updated 6 years ago
- [ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction☆13Apr 21, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆23Feb 22, 2022Updated 4 years ago
- Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources☆12Apr 12, 2018Updated 8 years ago
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Jun 28, 2018Updated 7 years ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Mar 23, 2018Updated 8 years ago
- Watset: Automatic Induction of Synsets from a Graph of Synonyms☆16Jul 7, 2019Updated 6 years ago
- Information extraction from English and German texts based on predicate logic☆393Jul 8, 2022Updated 3 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago