sheerun / awesome-polish-nlpLinks
Resources for doing NLP in Polish
☆48Updated 6 years ago
Alternatives and similar repositories for awesome-polish-nlp
Users that are interested in awesome-polish-nlp are comparing it to the libraries listed below
Sorting:
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆70Updated 3 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 8 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆329Updated 8 months ago
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆477Updated 2 years ago
- RoBERTa models for Polish☆89Updated 3 years ago
- ☆50Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆23Updated 3 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆183Updated 2 years ago
- spaCy + UDPipe☆165Updated 3 years ago
- Deep learning with text doesn't have to be scary.☆275Updated 3 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Polish BERT☆72Updated 5 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆86Updated 3 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 4 months ago
- Information extraction from English and German texts based on predicate logic☆392Updated 3 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆306Updated 4 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- A compound word splitter for Python☆49Updated 4 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆79Updated 4 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆364Updated last year
- A word2vec negative sampling implementation with correct CBOW update.☆261Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.☆259Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 5 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆27Updated 3 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆240Updated last year
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year