bieli / stopwordsLinks
Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes
☆44Updated 12 years ago
Alternatives and similar repositories for stopwords
Users that are interested in stopwords are comparing it to the libraries listed below
Sorting:
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆307Updated 4 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆368Updated last year
- Polish Dataset of Banned Harmful and Offensive Content from Wykop.pl web service☆59Updated last year
- ☆30Updated 3 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆515Updated last year
- All languages stopwords collection☆476Updated 2 years ago
- An easy to use python package for deep learning-based german sentiment classification.☆58Updated 3 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆174Updated 2 months ago
- RoBERTa models for Polish☆90Updated 3 years ago
- Python lemmatizer for Polish.☆19Updated 6 years ago
- Arabic nested named entity recognition☆45Updated 11 months ago
- BERT for Arabic Topic Modeling: An Experimental Study on BERTopic Technique☆28Updated 4 years ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/☆767Updated last week
- Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)☆20Updated 3 years ago
- Open German WordNet☆100Updated last month
- CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages☆46Updated 9 months ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆70Updated 4 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated last year
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,630Updated 2 months ago
- Train Spacy ner with custom dataset☆182Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago
- Context-Sensitive Neural Spelling Checker☆20Updated last year
- 🧹 Python package for text cleaning☆1,001Updated 2 weeks ago
- How to train Word2Vec for your language.☆10Updated 8 years ago
- Resources for doing NLP in Polish☆48Updated 6 years ago
- ☆51Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Updated 2 weeks ago
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆900Updated last year
- Spelling corrector in python☆492Updated 7 months ago