sheerun / awesome-polish-nlp
Resources for doing NLP in Polish
☆45Updated 5 years ago
Alternatives and similar repositories for awesome-polish-nlp:
Users that are interested in awesome-polish-nlp are comparing it to the libraries listed below
- ☆50Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆294Updated 3 years ago
- RoBERTa models for Polish☆86Updated 2 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆66Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Polish morphological tagger.☆42Updated last year
- Pre-trained models and language resources for Natural Language Processing in Polish☆331Updated 7 months ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 7 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆35Updated 4 months ago
- Polish BERT☆70Updated 4 years ago
- ☆27Updated 2 years ago
- ☆76Updated last year
- Language detection extension for spaCy 2.0+☆112Updated 5 years ago
- ☆18Updated 9 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Updated last year
- spaCy + UDPipe☆161Updated 2 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆33Updated 5 years ago
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆50Updated last week
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆75Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆192Updated last year
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- ULMFiT Method for German Language☆15Updated 5 years ago
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- Compound splitter for German☆104Updated 4 years ago
- Storage and retrieval of Word Embeddings in various databases☆51Updated 6 years ago
- Slides and code examples to my talks☆27Updated last month
- Bag of, not words, but tricks!☆68Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago