bagrii / address_extraction
Extracting addresses from text
☆41Updated 6 years ago
Alternatives and similar repositories for address_extraction:
Users that are interested in address_extraction are comparing it to the libraries listed below
- This repository contains an implementation of a US address parser built using spaCy NLP library.☆36Updated last year
- Extract dates from text☆64Updated 3 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆147Updated last week
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Python interface to Apache PDFBox command-line tools.☆75Updated last year
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Python address detector and parser☆204Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 11 months ago
- A package to structure Australian addresses☆196Updated 2 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- Language detection using Spacy and Fasttext☆54Updated last year
- Build a deep learning model for predicting the named entities from text.☆56Updated 6 years ago
- Company Name Processor written in Python☆329Updated 8 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 4 years ago
- name search for people and entities on the EU, OFAC and UN sanction lists☆24Updated 4 years ago
- Index Common Crawl archives in tabular format☆109Updated 2 months ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- API for OpenSanctions with support for entity search and bulk matching of data collections. Supports Reconciliation API spec.☆77Updated this week
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆202Updated this week
- Named Entity Recognition project, which goal is to detect brands from Ebay/Amazon product titles.☆83Updated 7 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- A compound word splitter for Python☆48Updated 3 years ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆121Updated 2 weeks ago
- Data Processing and Machine learning methods for the Open Skills Project☆170Updated last month
- How can we improve name matching in screening tools?☆11Updated 3 months ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- Ultimate Website Sitemap Parser☆190Updated this week