ing-bank / EntityMatchingModel
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
☆71Updated last month
Alternatives and similar repositories for EntityMatchingModel:
Users that are interested in EntityMatchingModel are comparing it to the libraries listed below
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆149Updated this week
- Fast, flexible name matching for large datasets☆71Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆118Updated last week
- A tutorial on entity resolution (record linkage or de-duplication)☆62Updated 4 years ago
- demo using FuzzyWuzzy matching company names☆74Updated 3 years ago
- An End-to-End Evaluation Framework for Entity Resolution Systems☆27Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆111Updated 4 months ago
- Google Trends, made easy.☆104Updated 10 months ago
- A browser user interface for manual labeling of record pairs.☆46Updated last year
- List of entity resolution software and resources.☆63Updated last month
- A Python client for the GDELT 2.0 Doc API☆127Updated last week
- ☆31Updated this week
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 9 months ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Python package for text mining of time-series data☆71Updated 4 months ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 6 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆144Updated 5 months ago
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- Nesta's Skills Extractor Library☆129Updated 5 months ago
- Innovation across ages☆69Updated 2 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- https://github.com/jcgcarranza/respol_patents_code☆32Updated 4 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- The FBAdLibrarian is a simple tool that can pull ad data and collects images offered by Facebook’s Ad Library API.☆15Updated 2 years ago
- Resources for economic research on data privacy☆15Updated 5 years ago
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- Resources for tackling record linkage / deduplication / data matching problems☆122Updated last year
- PyDST is a python module for accessing the API of Statistics Denmark. https://kristianuruplarsen.github.io/pydst/☆16Updated 2 weeks ago