elyase / geotext
Geotext extracts country and city mentions from text
☆139Updated 2 years ago
Alternatives and similar repositories for geotext:
Users that are interested in geotext are comparing it to the libraries listed below
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆129Updated last year
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- A python client for connecting to all the services provided by https://dandelion.eu☆36Updated last year
- Create a Geonames gazetteer index in Elasticsearch☆76Updated last year
- Guess gender from first name in Python 2 and 3☆133Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Textpipe: clean and extract metadata from text☆301Updated 3 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 9 months ago
- A fork of boilerpipe with python 3 and small fixes, ported from source `https://pypi.python.org/pypi/boilerpipe-py3.☆45Updated 5 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆151Updated 3 months ago
- Extract dates from text☆64Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- A file that contains the schema for GDELT 2.0 Header rows for the Events Database.☆49Updated 6 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Freeling wrapper☆12Updated 8 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- ☆52Updated last year
- A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.☆119Updated 2 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆63Updated 5 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆58Updated 7 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago