Tutanchamon / pl_stemmerLinks
A very simple python stemmer for Polish language based on Porter's Algorithm
☆20Updated 7 years ago
Alternatives and similar repositories for pl_stemmer
Users that are interested in pl_stemmer are comparing it to the libraries listed below
Sorting:
- ☆18Updated 9 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆76Updated 3 years ago
- CoreNLG is an easy to use and productivity oriented Python library for Natural Language Generation. It aims to provide the essential tool…☆27Updated 3 years ago
- ☆50Updated 2 years ago
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆12Updated 2 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- No Teacher BART distillation experiment for NLI tasks☆27Updated 4 years ago
- ☆18Updated 7 years ago
- Polish BERT☆70Updated 4 years ago
- Extension of scikit-learn TfidfVectorizer and CountVectorizer that allows for online learning / partial fit.☆34Updated 7 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Jupyter Widget for data annotation☆139Updated 2 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Tool for named entity recognition for Polish based on deep learning.☆31Updated 2 years ago
- Context sensitive spell checker for Icelandic based on a recurrent neural network model from karpathy/char-rnn. This repo is no longer in…☆40Updated 9 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- KenLM extension for spaCy 2.0.☆16Updated 7 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆192Updated last year
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Updated 4 years ago
- Text vectorization tool to outperform TFIDF for classification tasks☆194Updated 11 months ago