DeNederlandscheBank / name_matching
Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from different databases together to allow them to be merged.
☆152Updated last week
Alternatives and similar repositories for name_matching
Users that are interested in name_matching are comparing it to the libraries listed below
Sorting:
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆71Updated 2 months ago
- Name matching algorithm for company and people name in English☆13Updated last year
- 📛 Fuzzy Name Matching with Machine Learning☆264Updated 11 months ago
- Company Name Processor written in Python☆338Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆118Updated last month
- Fast, flexible name matching for large datasets☆72Updated last year
- demo using FuzzyWuzzy matching company names☆75Updated 3 years ago
- ☆32Updated last month
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆113Updated 5 months ago
- Match Patent Assignees with Compustat and SDC via Bing Search☆50Updated 4 years ago
- MD&A sections from 10-Ks; 2002-2018☆34Updated 5 months ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 7 months ago
- https://github.com/jcgcarranza/respol_patents_code☆34Updated 4 years ago
- ☆19Updated 3 years ago
- This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017☆165Updated 7 months ago
- Innovation across ages☆69Updated 2 years ago
- ☆32Updated last year
- Super Fast String Matching in Python☆367Updated 2 months ago
- Functions for extracting commonly used linguistic features from text.☆11Updated 2 years ago
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- Extract the Management Discussion and Analyses (MD&A) section from 10K Financial Statements☆71Updated 2 years ago
- Google Ticker Stock SVI (TS-SVI)☆12Updated 5 months ago
- Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP…☆10Updated 2 years ago
- Nesta's Skills Extractor Library☆135Updated 6 months ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆83Updated 6 months ago
- A mapping between SDCs M&A database and the gvkey's in Compustat☆82Updated 10 months ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- Given a job title and job description, the algorithm assigns a standard occupational classification (SOC) code to the job.☆73Updated 10 months ago
- Record linking package that fuzzy matches two Python pandas dataframes using sqlite3 fts4☆283Updated 2 years ago
- This repo contains the link tables between ISIN and many other company/security identity codes.☆37Updated 2 weeks ago