devmount / GermanWordEmbeddings
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
☆236Updated 7 months ago
Alternatives and similar repositories for GermanWordEmbeddings:
Users that are interested in GermanWordEmbeddings are comparing it to the libraries listed below
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- Compound splitter for German☆104Updated 5 years ago
- A lemmatizer for German language text☆88Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆475Updated 5 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆140Updated 4 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated last month
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- GermaNet API for Python☆53Updated 7 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆156Updated 2 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆76Updated 3 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆213Updated 3 years ago
- ☆18Updated 2 weeks ago
- Open German WordNet☆94Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆80Updated last year
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆49Updated last year
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- The Zurich Dependency Parser for German☆84Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆24Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆379Updated 4 months ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago