vhyza / lemmagen-lexicons
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin
☆13Updated 6 years ago
Alternatives and similar repositories for lemmagen-lexicons:
Users that are interested in lemmagen-lexicons are comparing it to the libraries listed below
- Elasticsearch lemmatizer for 15 languages☆105Updated 3 months ago
- Hunspell extension for spaCy 2.0.☆94Updated 8 months ago
- Data files of German Decompounder for Apache Lucene / Apache Solr / Elasticsearch☆105Updated 3 years ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Polish morphological tagger.☆43Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- Decompounding Plugin for Elasticsearch☆87Updated 4 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆66Updated 3 years ago
- Examples of Solr configuration entries for Solr plugins and Conceptual Search\Semantic Search from Simon Hughes Dice.com☆26Updated 8 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated 2 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆189Updated last year
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆65Updated 10 years ago
- Elasticsearch proxy for Quepid.☆13Updated last month
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 7 years ago
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 4 years ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 7 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- A lemmatizer for German language text☆88Updated 2 years ago
- [experiment] CRF-based disambiguation engine for pymorphy2☆10Updated 8 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- stop word lists in several languages☆21Updated 8 years ago
- Library for unit extraction - fork of quantulum for python3☆137Updated 9 months ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year