bieli / stopwordsLinks
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆44Updated 12 years ago
Alternatives and similar repositories for stopwords
Users that are interested in stopwords are comparing it to the libraries listed below
Sorting:
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆306Updated 4 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆361Updated last year
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆57Updated 9 months ago
- French word embeddings from series sub-titles☆22Updated 7 years ago
- Python lemmatizer for Polish.☆19Updated 6 years ago
- Resources for doing NLP in Polish☆48Updated 6 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆58Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆507Updated last year
- ☆30Updated 3 years ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- Tensorflow based implementation of deep siamese LSTM network to capture phrase/sentence similarity using character/word embeddings☆13Updated 7 years ago
- RoBERTa models for Polish☆88Updated 3 years ago
- Python port of Stempel, an algorithmic stemmer for Polish language.☆39Updated last year
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆207Updated 9 months ago
- Python 3 wrapper for SentiStrength. SentiStrength is capable of automatic sentiment analysis of up to 16,000 social web texts per second …☆42Updated last year
- BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique☆28Updated 4 years ago
- ☆11Updated 2 years ago
- French language support for TextBlob.☆59Updated 5 years ago
- Spelling corrector in python☆489Updated 4 months ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- ☆50Updated 3 years ago
- French stopwords collection☆99Updated 5 years ago
- All languages stopwords collection☆463Updated last year
- Context-Sensitive Neural Spelling Checker☆20Updated last year
- Python scrapper for otodom☆21Updated 8 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆173Updated 3 months ago
- Blazingly fast cleaning swear words (and their leetspeak) in strings☆224Updated last year
- A tokenizer for Icelandic text.☆29Updated last week
- Arabic nested named entity recognition☆42Updated 8 months ago
- A visualization of Warsaw public transport☆90Updated 2 years ago