vhyza / lemmagen-lexicons
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin
☆13Updated 6 years ago
Alternatives and similar repositories for lemmagen-lexicons:
Users that are interested in lemmagen-lexicons are comparing it to the libraries listed below
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆33Updated 8 years ago
- Detect and visualize text reuse☆118Updated 5 months ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- stop word lists in several languages☆21Updated 7 years ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- French language support for TextBlob.☆59Updated 4 years ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 7 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆65Updated 3 years ago
- Slides and code examples to my talks☆27Updated 2 months ago
- Polish morphological tagger.☆43Updated last year
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆160Updated 4 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- Guess gender from first name in Python 2 and 3☆132Updated 2 years ago
- ☆28Updated 9 years ago
- Slovak support for Elastic Search (with Dockerfile)☆18Updated 4 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Updated 6 years ago
- Geotext extracts country and city mentions from text☆138Updated 2 years ago
- spaCy REST API, wrapped in a Docker container.☆266Updated 2 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆64Updated 10 years ago
- small Java library for splitting German compound words☆62Updated 9 months ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆66Updated 7 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated last year
- A tokenizer and sentence splitter for German and English web and social media texts.☆138Updated 2 months ago