somnathrakshit / geograpy3
Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.
☆126Updated 10 months ago
Alternatives and similar repositories for geograpy3:
Users that are interested in geograpy3 are comparing it to the libraries listed below
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Geotext extracts country and city mentions from text☆138Updated 2 years ago
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆62Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- Create a Geonames gazetteer index in Elasticsearch☆74Updated last year
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆122Updated last month
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆157Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- An open-source package for python to clean raw text data☆69Updated last year
- Full text geoparsing/toponym resolution with event geolocation☆72Updated last week
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Extract text from HTML☆133Updated 4 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 6 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 9 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆141Updated 4 months ago
- ☆168Updated 8 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆150Updated last year
- A TextBlob sentiment analysis pipeline component for spaCy.☆56Updated 4 months ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- Extract dates from text☆64Updated 4 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- A Python tool to pull the complete edit history of a Wikipedia page☆20Updated 2 months ago
- A browser user interface for manual labeling of record pairs.☆44Updated last year
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- ☆68Updated 2 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated last year
- Library for unit extraction - fork of quantulum for python3☆136Updated 7 months ago