kwrobel-nlp / krnnt
Polish morphological tagger.
☆42Updated last year
Alternatives and similar repositories for krnnt:
Users that are interested in krnnt are comparing it to the libraries listed below
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Updated last year
- spaCy + UDPipe☆161Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 5 months ago
- RoBERTa models for Polish☆86Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated last year
- ☆64Updated last year
- Polish data.☆11Updated 2 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated 7 months ago
- Language detection extension for spaCy 2.0+☆112Updated 5 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆22Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆99Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆151Updated 2 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Sentence transformers models for SpaCy☆107Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆234Updated 2 years ago
- Compound splitter for German☆104Updated 4 years ago
- ☆50Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Tool for named entity recognition for Polish based on deep learning.☆30Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆79Updated 11 months ago
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆12Updated last year