bieli / stopwords
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆45Updated 11 years ago
Alternatives and similar repositories for stopwords:
Users that are interested in stopwords are comparing it to the libraries listed below
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆297Updated 3 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆335Updated 9 months ago
- RoBERTa models for Polish☆86Updated 2 years ago
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆50Updated last month
- Resources for doing NLP in Polish☆47Updated 5 years ago
- ☆29Updated 2 years ago
- ☆50Updated 2 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆68Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- A visualization of Warsaw public transport☆90Updated 2 years ago
- ParlaMint: Comparable Parliamentary Corpora☆55Updated 2 weeks ago
- Skrypty, tutoriale oraz programistyczna baza wiedzy dotycząca pracy z modelem Bielik.☆94Updated last month
- Python port of Stempel, an algorithmic stemmer for Polish language.☆36Updated 6 months ago
- Python lemmatizer for Polish.☆18Updated 5 years ago
- Kod do cyklu wykładów online☆45Updated 3 years ago
- How to train Word2Vec for your language.☆11Updated 7 years ago
- Spelling corrector in python☆474Updated 2 months ago
- Generator obostrzeń covidowych☆535Updated last year
- Polish morphological tagger.☆43Updated last year
- Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.☆44Updated 3 years ago
- Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)☆19Updated 2 years ago
- Trained PyTorch models for polish language sentiment prediction based on allegro/herbert and CLARIN-PL datasets☆11Updated 3 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆60Updated 2 years ago
- CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages☆40Updated 2 weeks ago
- Polish dictionary for IntelliJ / Polski słownik do IntelliJ☆105Updated 4 years ago
- [GSOC] Greek language support for spacy.io python NLP software☆100Updated 6 years ago
- competition application☆10Updated 2 years ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆370Updated 5 months ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆236Updated 6 months ago