openeventdata / mordecai
Full text geoparsing as a Python library
☆749Updated 3 years ago
Alternatives and similar repositories for mordecai:
Users that are interested in mordecai are comparing it to the libraries listed below
- Create a Geonames gazetteer index in Elasticsearch☆76Updated last year
- geoparsepy is a Python geoparsing library that will extract and disambiguate locations from text. It uses a local OpenStreetMap database …☆62Updated 3 years ago
- NLP, before and after spaCy☆2,225Updated last year
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆606Updated 6 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆129Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆731Updated 8 months ago
- 💫 Jupyter notebooks for spaCy examples and tutorials☆288Updated 6 years ago
- 🦆 Contextually-keyed word vectors☆1,650Updated 2 weeks ago
- Fast word vectors with little memory usage in Python☆417Updated 3 years ago
- A toolkit for making domain-specific probabilistic parsers☆801Updated 7 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,383Updated 3 months ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- A spaCy pipeline and model for NLP on unstructured legal text.☆649Updated 9 months ago
- Company Name Processor written in Python☆338Updated 11 months ago
- Textpipe: clean and extract metadata from text☆301Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆824Updated 2 weeks ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,174Updated 9 months ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.☆119Updated 2 years ago
- A machine learning tool for fishing entities☆264Updated last month
- Smarter Manual Annotation for Resource-constrained collection of Training data☆227Updated 5 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆922Updated 8 months ago
- semi supervised guided topic model with custom guidedLDA☆507Updated 3 weeks ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- NER toolkit for HTML data☆259Updated last year