openstates / name_toolsLinks
DEPRECATED - name_tools for Open States and other projects
☆19Updated 5 years ago
Alternatives and similar repositories for name_tools
Users that are interested in name_tools are comparing it to the libraries listed below
Sorting:
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- Python's missing statistical Swiss Army knife☆15Updated 10 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Twitter text processing library (auto linking and extraction of usernames, lists and hashtags).☆178Updated last year
- Unicode transliteration in Python (clone of Tomaž Šolc repository at zemanta.com)☆114Updated 10 years ago
- A simple fuzzy matching set for python strings☆230Updated last year
- Automatically extracts and normalizes an online article or blog post publication date☆118Updated 2 years ago
- This is a clone of the Python duplicate code detection tool from http://sourceforge.net/projects/clonedigger/☆32Updated 10 years ago
- A decorator-based implementation of type checks.☆145Updated 5 years ago
- python library for extracting html microdata☆167Updated 2 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆87Updated last year
- Library for guessing a person's gender by their first name.☆58Updated 8 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 8 years ago
- Import tables from any Wikipedia article as a dataset in Python☆293Updated 4 years ago
- Text normalization library for Python☆202Updated 7 years ago
- Pretty HTML/XML rendering with syntax highlighting for BeautifulSoup objects in IPython notebook and qtconsole.☆70Updated 5 years ago
- Makes it easy to respect rate limits.☆96Updated 9 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago
- A module for querying the DOM tree and writing XPath expressions using native Python syntax.☆127Updated 7 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- A Python parser for data that only looks like JSON☆65Updated 2 years ago
- Snowball stemming library collection for Python☆121Updated 6 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆80Updated 3 years ago
- Sunburnt offspring solr client☆27Updated 3 years ago
- Memory-based shallow parser for Python☆74Updated 6 years ago
- mediawiki parser library☆105Updated last week
- Re-usable wrapper scripts for text document extractors.☆37Updated 9 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- ⛏ a library for scraping unreliable pages☆212Updated last month