psolin / cleanco
Company Name Processor written in Python
☆336Updated 10 months ago
Alternatives and similar repositories for cleanco:
Users that are interested in cleanco are comparing it to the libraries listed below
- Super Fast String Matching in Python☆367Updated 3 weeks ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆149Updated this week
- Simplifies use of the Dedupe library via Pandas☆135Updated 2 years ago
- Python address detector and parser☆208Updated last year
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆997Updated last year
- Examples for using the dedupe library☆410Updated 8 months ago
- Ultimate Website Sitemap Parser☆200Updated last week
- Clean US addresses following USPS pub 28 and RESO guidelines☆214Updated last year
- Extract price amount and currency symbol from a raw text string☆324Updated 2 months ago
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated 9 months ago
- a python library for parsing unstructured western names into name components.☆604Updated 5 months ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆71Updated last month
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- Command line tool for deduplicating CSV files☆420Updated 5 years ago
- Fuzzy string matching, grouping, and evaluation.☆758Updated last month
- Python bindings to libpostal for fast international address parsing/normalization☆803Updated 2 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 9 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆128Updated last year
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆149Updated 2 months ago
- A list of free data matching and record linkage software.☆378Updated last year
- Resources for tackling record linkage / deduplication / data matching problems☆122Updated last year
- A simple Python module for parsing human names into their individual components☆671Updated 10 months ago
- Extract text from HTML☆135Updated 4 years ago
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆155Updated last year
- Find dates inside text using Python and get back datetime objects☆650Updated 11 months ago
- Clean personally identifiable information from dirty dirty text.☆405Updated last year
- 🧹 Python package for text cleaning☆975Updated last year
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 3 months ago