madisonmay / CommonRegex
A collection of common regular expressions bundled with an easy to use interface.
β1,574Updated last year
Alternatives and similar repositories for CommonRegex:
Users that are interested in CommonRegex are comparing it to the libraries listed below
- A simple Python module for parsing human names into their individual componentsβ671Updated 10 months ago
- πͺΌ a python library for doing approximate and phonetic matching of strings.β2,119Updated last week
- Port of Google's language-detection library to Python.β1,783Updated last month
- A toolkit for making domain-specific probabilistic parsersβ800Updated 6 months ago
- Python address detector and parserβ208Updated last year
- a python library for parsing unstructured western names into name components.β604Updated 5 months ago
- Heuristic based boilerplate removal toolβ765Updated last month
- a python library for parsing unstructured United States address strings into address componentsβ1,559Updated 3 weeks ago
- Accurately generate all possible forms of an English word e.g "election" --> "elect", "electoral", "electorate" etc.β630Updated 3 years ago
- extract text from any document. no muss. no fuss.β4,062Updated 4 months ago
- python humanize functionsβ1,680Updated 2 years ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,014Updated last month
- Python bindings to libpostal for fast international address parsing/normalizationβ803Updated 2 months ago
- π Twisted Deferred Thread backend for Requests.β417Updated 5 years ago
- A port of Ruby on Rails' inflector to Pythonβ510Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.β374Updated 2 years ago
- The simplest way to extract text from PDFs in Pythonβ427Updated 2 years ago
- Python module (C extension and plain python) implementing Aho-Corasick algorithmβ991Updated last year
- Converts XML to Python objectsβ620Updated last year
- Web Content Retrieval for Humansβ’β620Updated 2 years ago
- Extracts the top level domain (TLD) from the URL given.β182Updated last year
- Fast multi-keyword search engine for text stringsβ252Updated 7 months ago
- Summarizes news articlesβ1,169Updated 3 years ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectorsβ1,216Updated 2 weeks ago
- Clean personally identifiable information from dirty dirty text.β405Updated last year
- CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.β1,832Updated 8 months ago
- Just the facts -- web page content extractionβ1,260Updated 9 months ago
- A toolbelt of useful classes and functions to be used with python-requestsβ1,008Updated 3 months ago
- ASCII transliterations of Unicode text - GitHub mirrorβ559Updated 11 months ago
- Magic decorator syntax for asynchronous code in Pythonβ1,459Updated 5 years ago