psolin / cleancoLinks
Company Name Processor written in Python
☆348Updated last week
Alternatives and similar repositories for cleanco
Users that are interested in cleanco are comparing it to the libraries listed below
Sorting:
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆161Updated 2 weeks ago
- Super Fast String Matching in Python☆372Updated 8 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆286Updated 3 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,032Updated last year
- a python library for parsing unstructured western names into name components.☆614Updated 6 months ago
- Simplifies use of the Dedupe library via Pandas☆136Updated 2 years ago
- Find dates inside text using Python and get back datetime objects☆665Updated last year
- 📛 Fuzzy Name Matching with Machine Learning☆267Updated last year
- A list of free data matching and record linkage software.☆394Updated last year
- Examples for using the dedupe library☆415Updated last year
- Clean US addresses following USPS pub 28 and RESO guidelines☆229Updated last year
- Fuzzy string matching, grouping, and evaluation.☆784Updated 4 months ago
- Ultimate Website Sitemap Parser☆234Updated last month
- A simple Python module for parsing human names into their individual components☆697Updated last year
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆325Updated last month
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- Full text geoparsing as a Python library☆753Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.☆259Updated last year
- Postal code geocoding and distance calculation☆255Updated last month
- Extract price amount and currency symbol from a raw text string☆342Updated last month
- Python bindings to libpostal for fast international address parsing/normalization☆858Updated last month
- Clean personally identifiable information from dirty dirty text.☆416Updated 2 years ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆416Updated last week
- Extract embedded metadata from HTML markup☆934Updated 2 months ago
- Library for unit extraction - fork of quantulum for python3☆145Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- LexNLP by LexPredict☆754Updated last year
- demo using FuzzyWuzzy matching company names☆75Updated 3 years ago
- Open Source Thesaurus of Job Titles in US English☆140Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago