elyase / geotext
Geotext extracts country and city mentions from text
☆138Updated 2 years ago
Alternatives and similar repositories for geotext:
Users that are interested in geotext are comparing it to the libraries listed below
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆127Updated 11 months ago
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- Guess gender from first name in Python 2 and 3☆133Updated 2 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Full text geoparsing as a Python library☆746Updated 3 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆246Updated 10 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 2 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆62Updated 4 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆148Updated 2 months ago
- Extract dates from text☆64Updated 4 years ago
- ☆59Updated 3 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Get list of common stop words in various languages in Python☆155Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- A compound word splitter for Python☆48Updated 3 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.☆105Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆604Updated 4 months ago
- ☆50Updated last year
- Hunspell extension for spaCy 2.0.☆94Updated 7 months ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Extract text from HTML☆134Updated 4 years ago