morfologik / polimorfologik
Scripts for preprocessing morfologik data.
☆40Updated 7 years ago
Alternatives and similar repositories for polimorfologik
Users that are interested in polimorfologik are comparing it to the libraries listed below
Sorting:
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆192Updated last year
- Polish morphological tagger.☆43Updated last year
- ☆18Updated 9 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆37Updated 8 months ago
- German part-of-speech dictionary☆45Updated last year
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- Program used to split text into segments☆26Updated 6 months ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 9 years ago
- Python lemmatizer for Polish.☆18Updated 5 years ago
- small Java library for splitting German compound words☆63Updated 11 months ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆71Updated last year
- Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefi…☆36Updated last month
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 7 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Unitex/GramLab Language Resources☆19Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Search back-end for dependency tree search. See the docs at https://fginter.github.io/dep_search/☆17Updated 7 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Unitex/GramLab C++ Core☆23Updated last year
- A language detection Web Service☆53Updated 8 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated 2 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆42Updated 12 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year