Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
☆92Mar 11, 2026Updated last month
Alternatives and similar repositories for EntityMatchingModel
Users that are interested in EntityMatchingModel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyCodeHash is a generic data and code hashing library that facilitates downstream caching.☆13Jan 26, 2026Updated 2 months ago
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆164Apr 3, 2026Updated last week
- A Python 3.7 package for the econometric analysis of networks☆22Oct 10, 2024Updated last year
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆20Mar 9, 2026Updated last month
- Company names matching: match company names to legal names and stock symbols☆18Aug 8, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Name matching algorithm for company and people name in English☆15Dec 3, 2023Updated 2 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆14Apr 1, 2026Updated 2 weeks ago
- Individual claims history simulation machine☆19Sep 25, 2019Updated 6 years ago
- A workflow framework for statistical package development☆66Updated this week
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆511Jan 9, 2026Updated 3 months ago
- 30 key split keyboard☆13May 7, 2024Updated last year
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 4 months ago
- Code for "Fine-Tuned 'Small' LLMs (Still) Significantly Outperform Zero-Shot Generative AI Models in Text Classification", arXiv 2024☆14Jun 24, 2024Updated last year
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Nov 7, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Fast, flexible name matching for large datasets☆71Aug 29, 2025Updated 7 months ago
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- demo using FuzzyWuzzy matching company names☆75Feb 22, 2022Updated 4 years ago
- Create an Anime database containing all the Anime currently available on the website, which includes: 'Anime Title', 'Description', 'C…☆12Jun 10, 2020Updated 5 years ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆422Apr 9, 2026Updated last week
- causalweight: An R Package for Causal Inference and Mediation Analysis☆15Mar 6, 2019Updated 7 years ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- ATLAS (All The Locations of All Servers) - Global data center mapping project with 6,266+ verified locations across 155 countries. Com…☆47Oct 13, 2025Updated 6 months ago
- 30/36 key 3row keyboards with col stagger☆17Aug 28, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A static file containing a list of popular RSS feeds.☆13Aug 25, 2016Updated 9 years ago
- Python Interface for querying WRDS datasets (CRSP, COMPUSTAT)☆11Mar 15, 2014Updated 12 years ago
- Podcast index database quality dashboard☆15Mar 15, 2026Updated last month
- this is a Manual Named-Entities/Part-of-speech Tagger for Spacy, You can use it to create your own training datasets.☆12Jun 16, 2018Updated 7 years ago
- Autonomous Development System for Claude Code☆37Mar 14, 2026Updated last month
- Python module for measure the degree of association between variables☆13Apr 20, 2022Updated 3 years ago
- 🦋 Small scripts and tools to do data stuff with the AT Protocol.☆13Nov 28, 2024Updated last year
- ETL-10-K-Filings is a Python-based open-source project designed for ETL of financial data from SEC Edgar filings. Focusing on the MDA Sec…☆16Feb 11, 2024Updated 2 years ago
- Efficient implementation of Learning Time-Series Shapelets using keras☆25Aug 29, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- ☆17Dec 15, 2023Updated 2 years ago
- Duke Machine Learning Winter School: Computer Vision 2022☆10Jan 3, 2022Updated 4 years ago
- Calendars for various securities exchanges.☆14Dec 18, 2020Updated 5 years ago
- Data from paper: "Benign Effects of Automation: New Evidence from Patent Texts"☆12May 31, 2025Updated 10 months ago
- Using Natural Language Processing to standardize Company Names☆11Aug 4, 2021Updated 4 years ago
- A wrapper around Python's ctypes for Nim-specific function signatures.☆12Dec 12, 2017Updated 8 years ago