openeventdata / mordecai
Full text geoparsing as a Python library
☆747Updated 3 years ago
Alternatives and similar repositories for mordecai:
Users that are interested in mordecai are comparing it to the libraries listed below
- Python bindings to libpostal for fast international address parsing/normalization☆803Updated 2 months ago
- a python library for parsing unstructured western names into name components.☆604Updated 5 months ago
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- Create a Geonames gazetteer index in Elasticsearch☆76Updated last year
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆62Updated 3 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- Textpipe: clean and extract metadata from text☆302Updated 3 years ago
- Various Algorithms for Short Text Mining☆470Updated last week
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- NLP, before and after spaCy☆2,222Updated last year
- A knowledge base construction engine for richly formatted data☆409Updated 3 years ago
- Examples for using the dedupe library☆411Updated 8 months ago
- Fast word vectors with little memory usage in Python☆417Updated 3 years ago
- semi supervised guided topic model with custom guidedLDA☆505Updated this week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆997Updated last year
- a Deep Learning Framework for Text https://delft.readthedocs.io/☆398Updated last month
- A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.☆119Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆310Updated 2 months ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 7 months ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- Python address detector and parser☆208Updated last year
- Information extraction from English and German texts based on predicate logic☆390Updated 2 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆343Updated 2 years ago
- Find dates inside text using Python and get back datetime objects☆650Updated 11 months ago
- 💫 REST microservices for various spaCy-related tasks☆240Updated 2 years ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,175Updated 9 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,379Updated 2 months ago
- A spaCy pipeline and model for NLP on unstructured legal text.☆648Updated 9 months ago