bieli / stopwordsLinks
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆45Updated 12 years ago
Alternatives and similar repositories for stopwords
Users that are interested in stopwords are comparing it to the libraries listed below
Sorting:
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆57Updated 8 months ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆304Updated 4 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆156Updated 2 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆357Updated last year
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆502Updated 11 months ago
- Polish morphological tagger.☆43Updated 2 years ago
- ☆30Updated 2 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆172Updated 2 months ago
- French stopwords collection☆98Updated 5 years ago
- Python lemmatizer for Polish.☆19Updated 6 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆58Updated 3 years ago
- ☆13Updated last week
- A curated list of NLP resources for Hungarian☆257Updated 2 months ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated last year
- MorphoDiTa: Morphologic Dictionary and Tagger☆74Updated 3 weeks ago
- French word embeddings from series sub-titles☆22Updated 7 years ago
- All languages stopwords collection☆458Updated last year
- RoBERTa models for Polish☆88Updated 3 years ago
- A French Lemmatizer in Python based on the LEFFF☆42Updated 5 years ago
- Open German WordNet☆97Updated 2 weeks ago
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆50Updated 4 months ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆44Updated 6 years ago
- This repo is the home of Romanian Transformers.☆106Updated 3 years ago
- A data set and model for german sentiment classification.☆67Updated 4 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 8 months ago
- Python Multilingual Ucrel Semantic Analysis System☆31Updated last year
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 8 years ago
- This repo provides a python module to work with Open Dutch WordNet. It was created using python 3.4.☆67Updated 4 years ago