DeNederlandscheBank / name_matching
Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from different databases together to allow them to be merged.
☆143Updated this week
Alternatives and similar repositories for name_matching:
Users that are interested in name_matching are comparing it to the libraries listed below
- Company Name Processor written in Python☆332Updated 9 months ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆67Updated 2 weeks ago
- demo using FuzzyWuzzy matching company names☆73Updated 2 years ago
- Fast, flexible name matching for large datasets☆70Updated last year
- TFIDF / KNN based string matching☆51Updated last year
- Super Fast String Matching in Python☆363Updated last week
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆113Updated 3 weeks ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆109Updated 2 months ago
- ☆30Updated last month
- 📛 Fuzzy Name Matching with Machine Learning☆262Updated 7 months ago
- Name matching algorithm for company and people name in English☆13Updated last year
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- https://github.com/jcgcarranza/respol_patents_code☆30Updated 4 years ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- Nesta's Skills Extractor Library☆126Updated 3 months ago
- ☆31Updated last year
- Match Patent Assignees with Compustat and SDC via Bing Search☆48Updated 4 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆257Updated 3 months ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆399Updated 2 months ago
- This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017☆154Updated 4 months ago
- MD&A sections from 10-Ks; 2002-2018☆33Updated 2 months ago
- ☆19Updated 3 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆116Updated 11 months ago
- Fuzzy matching and more functionality for spaCy.☆254Updated 7 months ago
- Functions for extracting commonly used linguistic features from text.☆11Updated 2 years ago
- Python implementation for scraping daily GoogleTrends data over long time periods☆47Updated 10 months ago
- A series of Jupyter Notebooks that demonstrate how to scrape data from the S&P Capital IQ Website, provided that you already have access …☆17Updated 5 years ago
- Simplifies use of the Dedupe library via Pandas☆135Updated last year
- This repo contains the link tables between ISIN and many other company/security identity codes.☆31Updated 4 months ago