Xangis / extra-stopwordsLinks
Extra stopword lists for use with NLTK.
☆30Updated last year
Alternatives and similar repositories for extra-stopwords
Users that are interested in extra-stopwords are comparing it to the libraries listed below
Sorting:
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Guess gender from first name in Python 2 and 3☆135Updated last month
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- Python wrapper for Stanford CoreNLP's SUTime☆154Updated 2 years ago
- Get list of common stop words in various languages in Python☆156Updated last year
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- A suite of tools for collecting, pre-processing, analyzing and sentiment-scoring twitter data☆22Updated 4 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- Extract dates from text☆64Updated 4 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Webz.io Python SDK☆43Updated 3 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆63Updated last year
- Projects☆21Updated 8 years ago
- [Project INVALID not supported anymore]☆37Updated 5 years ago
- Build a deep learning model for predicting the named entities from text.☆56Updated 6 years ago
- A Python package for gender classification.☆86Updated 2 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- French language support for TextBlob.☆59Updated 4 years ago
- Python wrapper for aspell (C extension and python version)☆82Updated 2 years ago
- Package that returns a company embedding given a company name☆46Updated 5 years ago
- N-gram Extraction Approaches (bigrams, trigrams)☆44Updated 6 years ago
- Docker images for production NLP usage including deep learning☆35Updated 6 years ago
- Use ML-Annotate to label data for machine learning purposes☆109Updated 4 years ago
- Extension of scikit-learn TfidfVectorizer and CountVectorizer that allows for online learning / partial fit.☆34Updated 7 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- ☆171Updated 3 months ago
- Time everything in IPython☆124Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago