psolin / cleanco
Company Name Processor written in Python
☆333Updated 9 months ago
Alternatives and similar repositories for cleanco:
Users that are interested in cleanco are comparing it to the libraries listed below
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆144Updated this week
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆282Updated 2 years ago
- Super Fast String Matching in Python☆364Updated 2 weeks ago
- Simplifies use of the Dedupe library via Pandas☆135Updated last year
- a python library for parsing unstructured western names into name components.☆599Updated 3 months ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆309Updated this week
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆148Updated 3 weeks ago
- Fuzzy string matching, grouping, and evaluation.☆751Updated this week
- 📛 Fuzzy Name Matching with Machine Learning☆262Updated 8 months ago
- Abydos NLP/IR library for Python☆184Updated 2 years ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆68Updated 3 weeks ago
- Examples for using the dedupe library☆409Updated 6 months ago
- Python address detector and parser☆206Updated last year
- A list of free data matching and record linkage software.☆376Updated last year
- Group thousands of similar spreadsheet or database text entries in seconds☆156Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆136Updated 7 months ago
- A simple Python module for parsing human names into their individual components☆668Updated 8 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆147Updated last month
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year
- Fast, flexible name matching for large datasets☆70Updated last year
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆399Updated 2 months ago
- Find dates inside text using Python and get back datetime objects☆641Updated 9 months ago
- Package that returns a company embedding given a company name☆44Updated 4 years ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- Name matching algorithm for company and people name in English☆13Updated last year
- Clean personally identifiable information from dirty dirty text.☆403Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆982Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- A Flexible Deep Learning Approach to Fuzzy String Matching☆141Updated 4 months ago