vladimarius / pyapLinks
Python address detector and parser
☆212Updated last year
Alternatives and similar repositories for pyap
Users that are interested in pyap are comparing it to the libraries listed below
Sorting:
- Find dates inside text using Python and get back datetime objects☆662Updated last year
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆154Updated last month
- A simple Python module for parsing human names into their individual components☆686Updated last year
- Python bindings to libpostal for fast international address parsing/normalization☆842Updated 7 months ago
- remove signature blocks from emails☆86Updated 6 years ago
- Parse, normalize and render postal addresses.☆185Updated last year
- A Python library for extracting titles, images, descriptions and canonical urls from HTML.☆151Updated 5 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- A simple fuzzy matching set for python strings☆229Updated last year
- Modern robots.txt Parser for Python☆196Updated last year
- Company Name Processor written in Python☆342Updated last year
- Automatically extracts and normalizes an online article or blog post publication date☆117Updated 2 years ago
- Extract countries, regions and cities from a URL or text☆217Updated 5 years ago
- Street address parser and formatter☆91Updated 6 years ago
- Extract price amount and currency symbol from a raw text string☆337Updated 6 months ago
- NER toolkit for HTML data☆259Updated last year
- Ultimate Website Sitemap Parser☆226Updated 2 weeks ago
- python library for extracting html microdata☆167Updated 2 years ago
- Extract text from HTML☆134Updated 5 years ago
- Email reply parser library for Python☆508Updated last year
- Clean US addresses following USPS pub 28 and RESO guidelines☆227Updated last year
- Full text geoparsing as a Python library☆752Updated 3 years ago
- Clean personally identifiable information from dirty dirty text.☆414Updated 2 years ago
- A Scrapy extension to log items coverage when the spider shuts down☆19Updated 5 years ago
- Library for unit extraction - fork of quantulum for python3☆142Updated last year
- geonamescache - a Python library for quick access to a subset of GeoNames data.☆114Updated last week
- Get list of common stop words in various languages in Python☆156Updated last year
- A collection of common regular expressions bundled with an easy to use interface.☆1,578Updated 2 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago