morfologik / polimorfologik
Scripts for preprocessing morfologik data.
☆39Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for polimorfologik
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆188Updated last year
- Python port of Stempel, an algorithmic stemmer for Polish language.☆35Updated 2 months ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆70Updated 7 months ago
- Polish morphological tagger.☆43Updated last year
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 9 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆56Updated 2 years ago
- KEA - Keyphrase Extraction Algorithm☆21Updated 8 years ago
- Stemmer for German☆45Updated 2 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- German part-of-speech dictionary☆43Updated last year
- Elasticsearch Index Termlist☆117Updated 5 years ago
- A language detection Web Service☆53Updated 7 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆55Updated 7 years ago
- small Java library for splitting German compound words☆62Updated 6 months ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 6 years ago
- Decompounding Plugin for Elasticsearch☆87Updated 3 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- Hy-phen-ation made easy☆202Updated 3 weeks ago
- Baseform lemmatization for Elasticsearch☆26Updated 5 years ago
- JSuffixArrays (Suffix Arrays in Java)☆59Updated 7 years ago
- Python lemmatizer for Polish.☆18Updated 5 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 11 months ago
- A language detection library for the JVM☆36Updated last year
- Elasticsearch lemmatizer for 15 languages☆104Updated 5 months ago
- Polish data.☆11Updated last week
- ☆14Updated 5 years ago
- Resources for doing NLP in Polish☆44Updated 5 years ago
- ☆18Updated 9 years ago
- Snowball compiler and stemming algorithms☆758Updated this week