devmount / GermanWordEmbeddings
Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.
☆236Updated 6 months ago
Alternatives and similar repositories for GermanWordEmbeddings:
Users that are interested in GermanWordEmbeddings are comparing it to the libraries listed below
- Compound splitter for German☆104Updated 4 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- A lemmatizer for German language text☆87Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 3 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆138Updated 2 months ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆465Updated 3 months ago
- An unsupervised compound splitter☆41Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆180Updated last year
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated 2 years ago
- Various utilities for processing the data.☆207Updated this week
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆244Updated 2 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models…☆51Updated last year
- NLP French language model implementing ULMFiT☆87Updated 5 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Updated 5 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.☆44Updated 2 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆197Updated 3 months ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated 6 months ago
- spaCy REST API, wrapped in a Docker container.☆266Updated 2 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- Various Algorithms for Short Text Mining☆466Updated this week
- Universal Dependencies online documentation☆281Updated this week