bieli / stopwordsLinks
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆45Updated 11 years ago
Alternatives and similar repositories for stopwords
Users that are interested in stopwords are comparing it to the libraries listed below
Sorting:
- Pre-trained models and language resources for Natural Language Processing in Polish☆342Updated last year
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆300Updated 3 years ago
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆53Updated 4 months ago
- RoBERTa models for Polish☆87Updated 3 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- ☆50Updated 2 years ago
- ☆30Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- How to train Word2Vec for your language.☆11Updated 7 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- Szablon pracy inżynierskiej / magisterskiej na wydziale Informatyki i Telekomunikacji Politechniki Wrocławskiej☆27Updated 2 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆169Updated 8 months ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- Python lemmatizer for Polish.☆18Updated 5 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆60Updated 2 years ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆856Updated 10 months ago
- A Python library for calculating a large variety of metrics from text☆340Updated 6 months ago
- Polish morphological tagger.☆43Updated 2 years ago
- ☆31Updated 9 years ago
- R package for stylometric analyses☆192Updated 5 months ago
- Data Science PL knowledge base / baza wiedzy☆14Updated 5 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆38Updated 9 months ago
- Scripts for preprocessing morfologik data.☆40Updated 7 years ago
- ☆64Updated 3 years ago
- A collection of stop words from around the web☆13Updated 7 years ago
- Calculate text-statistics including Sylables, Flesch-Reading-Ease (english and german) and such things.☆11Updated 4 years ago
- A very simple python stemmer for Polish language based on Porter's Algorithm☆20Updated 7 years ago
- StyloMetrix☆42Updated 10 months ago
- Fixes contractions such as `you're` to `you are`☆319Updated 2 years ago
- Podlaskie aliasy dla gitowych komend☆688Updated 3 years ago