Xangis / extra-stopwordsLinks
Extra stopword lists for use with NLTK.
☆30Updated 2 years ago
Alternatives and similar repositories for extra-stopwords
Users that are interested in extra-stopwords are comparing it to the libraries listed below
Sorting:
- Get list of common stop words in various languages in Python☆159Updated last month
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆79Updated 4 years ago
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆190Updated 4 years ago
- ☆176Updated 9 months ago
- Open source Emoticons and Emoji detection library: emot☆195Updated 2 years ago
- Guess gender from first name in Python 2 and 3☆138Updated 7 months ago
- French language support for TextBlob.☆60Updated 5 years ago
- ☆129Updated 4 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- Simple, Pythonic extraction of text, shapes and images from PDFs☆80Updated 5 years ago
- Extension of scikit-learn TfidfVectorizer and CountVectorizer that allows for online learning / partial fit.☆34Updated 8 years ago
- Python interface to the LinkedIn API - V2☆57Updated 4 years ago
- Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.☆64Updated 11 years ago
- Python stemming library using snowball stemmers☆275Updated 2 weeks ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆183Updated 2 years ago
- Sentiment Classification using Word Sense Disambiguation☆170Updated 3 years ago
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 5 years ago
- Detect Language API Python Client☆71Updated 4 months ago
- N-gram Extraction Approaches (bigrams, trigrams)☆43Updated 7 years ago
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- A universal Python library for detecting and filtering profanity☆80Updated last year
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Fixes contractions such as `you're` to `you are`☆318Updated 3 years ago
- spellchecking library for python☆615Updated 3 months ago
- Find dates inside text using Python and get back datetime objects☆665Updated last year
- Python address detector and parser☆213Updated 2 years ago