bieli / stopwordsLinks
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆45Updated 11 years ago
Alternatives and similar repositories for stopwords
Users that are interested in stopwords are comparing it to the libraries listed below
Sorting:
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆302Updated 4 years ago
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆55Updated 6 months ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆170Updated last week
- Resources for doing NLP in Polish☆47Updated 5 years ago
- ☆50Updated 2 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆347Updated last year
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆492Updated 9 months ago
- ☆30Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated 11 months ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- All languages stopwords collection☆451Updated last year
- French word embeddings from series sub-titles☆22Updated 6 years ago
- Named Entity Recognition for Danish☆17Updated 6 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆753Updated 3 weeks ago
- Polish morphological tagger.☆44Updated 2 years ago
- French stopwords collection☆97Updated 5 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆206Updated 6 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Open source Emoticons and Emoji detection library: emot☆193Updated last year
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆604Updated last year
- A lemmatizer for German language text☆91Updated 2 years ago
- Norwegian NLP Resources☆181Updated 4 years ago
- Open German WordNet☆96Updated last year
- Train Spacy ner with custom dataset☆183Updated 2 years ago
- French language support for TextBlob.☆59Updated 5 years ago
- Pre-trained Nordic models for BERT☆174Updated 3 years ago
- A stemming system for the Greek language☆50Updated 3 years ago