iwpnd / flashgeotextLinks
Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.
β62Updated this week
Alternatives and similar repositories for flashgeotext
Users that are interested in flashgeotext are comparing it to the libraries listed below
Sorting:
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.β129Updated last year
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- An open-source package for python to clean raw text dataβ70Updated last year
- Scalable String Similarity Joins in Pythonβ39Updated 10 months ago
- data wrangling simplicity, complete audit transparency, and at speedβ34Updated 2 months ago
- A maximum-strength name parser for record linkage.β37Updated 3 weeks ago
- Python package for deduplication/entity resolution using active learningβ80Updated 9 months ago
- Extract networks of entities from journalistic reportingβ48Updated last year
- Language detection using Spacy and Fasttextβ55Updated last year
- Generate reports for spaCy models.β29Updated 3 years ago
- A browser user interface for manual labeling of record pairs.β47Updated last year
- Dataframe Integration with spaCy.β102Updated 4 years ago
- Library for unit extraction - fork of quantulum for python3β140Updated 11 months ago
- β30Updated 2 years ago
- β55Updated last year
- An End-to-End Evaluation Framework for Entity Resolution Systemsβ28Updated last year
- Easy PDF to text to spaCy text extraction in Python.β39Updated 7 months ago
- A Python implementation of Lunr.js πβ195Updated 2 months ago
- It's a cooler way to store simple linear models.β28Updated 10 months ago
- A simple converter from SpaCy Entities (Spans) to Huggingface BILOU formatted data (tokens and ner_tags)β14Updated 8 months ago
- Create a Geonames gazetteer index in Elasticsearchβ76Updated last year
- Annotation Management for Prodigy, that support multiple users working in many projectsβ15Updated 6 years ago
- Collection of code snippets and utilities for streamlit appsβ22Updated 5 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matchingβ145Updated 7 months ago
- Provide partial dates and retain the date precision through processingβ13Updated 2 years ago
- Information extraction from English and German texts based on predicate logicβ136Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidataβ161Updated 2 years ago
- Python wrapper for a C++ Double Metaphoneβ15Updated 3 weeks ago
- β69Updated 3 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the sameβ¦β29Updated 2 years ago