dzieciou / lemmatizer-pl
Python lemmatizer for Polish.
☆18Updated 5 years ago
Alternatives and similar repositories for lemmatizer-pl
Users that are interested in lemmatizer-pl are comparing it to the libraries listed below
Sorting:
- Python port of Stempel, an algorithmic stemmer for Polish language.☆37Updated 8 months ago
- Polish morphological tagger.☆43Updated last year
- Contextual Lemmatization and Morphological Tagging in 100 different languages. A Participant System for SigMorphon2019 Task 2☆24Updated 9 months ago
- [obsolete] Python interface to Morfeusz☆10Updated 7 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated last year
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 7 years ago
- linguistic data on the Yongning Na language☆7Updated last month
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated last year
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆28Updated 5 years ago
- An NLP pipeline for Hebrew☆37Updated 2 months ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated 10 months ago
- A human-annotated morphosyntactic treebank for Turkish.☆31Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- ☆18Updated 9 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Compound splitter for German☆105Updated 5 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆15Updated last year
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 3 years ago
- German sentiment scores with SentiWS as extension for spaCy☆37Updated 2 years ago
- ☆25Updated 2 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆156Updated this week
- ☆16Updated 5 years ago
- A stemming system for the Greek language☆48Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year