vhyza / lemmagen-lexiconsLinks
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin
☆13Updated 6 years ago
Alternatives and similar repositories for lemmagen-lexicons
Users that are interested in lemmagen-lexicons are comparing it to the libraries listed below
Sorting:
- Elasticsearch lemmatizer for 15 languages☆106Updated 6 months ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆272Updated 2 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- Updates to Zope's keyphrase extractor (forked from 1.1.0)☆67Updated 8 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 weeks ago
- A lexicon to be used for sentiment analysis in Greek.☆35Updated 5 years ago
- Elasticsearch/Solr Sandbox for exploring explain information and tweaking☆137Updated last year
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- natural language processing on german texts☆16Updated 7 years ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 7 years ago
- A Python port of the Apache Lucene ASCII Folding Filter that converts alphabetic, numeric, and symbolic Unicode characters which are not …☆15Updated 5 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆432Updated last year
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆193Updated last year
- Solr Query Segmenter for structuring unstructured queries☆22Updated 4 years ago
- Annotated data set consisting of user comments posted to a German-language newspaper website☆17Updated 6 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- small Java library for splitting German compound words☆63Updated last year
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated 2 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆63Updated 2 years ago
- Named entity extraction from Portuguese web text☆71Updated 7 years ago
- A lemmatizer for German language text☆91Updated 2 years ago
- ☆184Updated 6 years ago