Xangis / extra-stopwords
Extra stopword lists for use with NLTK.
☆29Updated last year
Alternatives and similar repositories for extra-stopwords:
Users that are interested in extra-stopwords are comparing it to the libraries listed below
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Guess gender from first name in Python 2 and 3☆133Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 8 months ago
- Get list of common stop words in various languages in Python☆155Updated last year
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- ☆168Updated this week
- Language Models for Zalando's flair library☆61Updated 5 years ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆56Updated 5 months ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- AsyncIO serving for data science models☆24Updated 2 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆268Updated last year
- (Official repo for pypi package) Python bindings for the Hunspell spellchecker engine☆186Updated 4 years ago
- Extract dates from text☆64Updated 4 years ago
- Library for unit extraction - fork of quantulum for python3☆137Updated 9 months ago
- Open source Emoticons and Emoji detection library: emot☆192Updated last year
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated last year
- A compound word splitter for Python☆48Updated 3 years ago
- Ensemble topic modeling with matrix factorization☆25Updated 6 years ago
- Babel Street Analytics Client Library for Python☆38Updated 2 weeks ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆149Updated 4 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆149Updated 2 months ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 6 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- spaCy + UDPipe☆161Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago