vhyza / lemmagen-lexiconsLinks
Language lexicons for elasticsearch https://github.com/vhyza/elasticsearch-analysis-lemmagen plugin
☆13Updated 6 years ago
Alternatives and similar repositories for lemmagen-lexicons
Users that are interested in lemmagen-lexicons are comparing it to the libraries listed below
Sorting:
- Elasticsearch lemmatizer for 15 languages☆108Updated 8 months ago
- Detect and visualize text reuse☆118Updated last year
- A lemmatizer for German language text☆91Updated 2 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 14 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆67Updated 4 years ago
- A machine learning tool for fishing entities☆265Updated 3 months ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated 2 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆69Updated 11 months ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 3 months ago
- Extract countries, regions and cities from a URL or text☆217Updated 4 years ago
- Python stemming library using snowball stemmers☆263Updated 2 weeks ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- UIMA CAS processing library written in Python☆90Updated 2 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆147Updated 8 months ago
- Spacy model trained based on Norwegian corpus converted from OBT to Universal dep.☆13Updated 7 years ago
- The oaipmh module is a Python implementation of an "Open Archives$ Initiative Protocol for Metadata Harvesting"☆87Updated 2 years ago
- small Java library for splitting German compound words☆63Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Open morphology for Finnish☆92Updated last week
- Named entity extraction from Portuguese web text☆71Updated 8 years ago