vladimarius / pyap
Python address detector and parser
☆208Updated last year
Alternatives and similar repositories for pyap:
Users that are interested in pyap are comparing it to the libraries listed below
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Python bindings to libpostal for fast international address parsing/normalization☆804Updated 2 months ago
- a python library for parsing unstructured western names into name components.☆604Updated 5 months ago
- Clean US addresses following USPS pub 28 and RESO guidelines☆214Updated last year
- Parse, normalize and render postal addresses.☆184Updated last year
- Modern robots.txt Parser for Python☆194Updated last year
- A package to structure Australian addresses☆198Updated 2 years ago
- Extract countries, regions and cities from a URL or text☆218Updated 4 years ago
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated last year
- remove signature blocks from emails☆86Updated 5 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆150Updated 3 months ago
- Find dates inside text using Python and get back datetime objects☆651Updated 11 months ago
- Clean personally identifiable information from dirty dirty text.☆407Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 9 months ago
- A toolkit for making domain-specific probabilistic parsers☆800Updated 6 months ago
- geonamescache - a Python library for quick access to a subset of GeoNames data.☆107Updated 8 months ago
- a python library for parsing unstructured United States address strings into address components☆1,559Updated this week
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- Street address parser and formatter☆91Updated 5 years ago
- Company Name Processor written in Python☆336Updated 11 months ago
- Examples for using the dedupe library☆411Updated 8 months ago
- NER toolkit for HTML data☆259Updated 11 months ago
- Library for unit extraction - fork of quantulum for python3☆138Updated 9 months ago
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆149Updated 4 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆63Updated 5 years ago
- A simple Python module for parsing human names into their individual components☆671Updated 10 months ago
- Email reply parser library for Python☆501Updated 8 months ago
- Extract text from HTML☆135Updated 4 years ago
- A simple fuzzy matching set for python strings☆226Updated 8 months ago
- Command line tool for deduplicating CSV files☆420Updated 5 years ago