muatik / genderizer
Genderizer is a language independent module which tries to detect gender by looking given first names and/or analyzing sample texts.
☆65Updated 10 years ago
Alternatives and similar repositories for genderizer:
Users that are interested in genderizer are comparing it to the libraries listed below
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 5 months ago
- A sentiment classifier tool and library trained on Twitter data☆22Updated last year
- Temporal Expression Recognition and Normalisation in Python☆78Updated 9 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆253Updated 4 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Tool that tries to guess a person's gender based on their name and location☆93Updated 7 months ago
- iPython-based tutorial in Noun Phrase chunking with the NLTK. Written to accompany PyCon 2015 poster presentation.☆17Updated 9 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Supervised learning for novelty detection in text☆78Updated 8 years ago
- Group workspace for improvements to the Columbia Newsblaster system.☆31Updated 8 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 8 years ago
- A thin wrapper around the DBPedia Spotlight REST API☆59Updated 10 months ago
- Tools and Libraries for Lexicon-Based Sentiment Analysis☆24Updated 8 years ago
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 14 years ago
- ☆40Updated 9 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Stability analysis for topic models☆51Updated 8 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Turning news into events since 2014.☆51Updated 7 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆59Updated 7 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 4 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Provide a comprehensive list of tokenizers, features, and general NLP things used for text analysis with examples. The initial focus is o…☆46Updated 9 years ago
- A Python library to calculate the readability score of a text.☆138Updated 7 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year