📛 Fuzzy Name Matching with Machine Learning
☆267Jun 17, 2024Updated last year
Alternatives and similar repositories for hmni
Users that are interested in hmni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast, flexible name matching for large datasets☆71Aug 29, 2025Updated 7 months ago
- demo using FuzzyWuzzy matching company names☆74Feb 22, 2022Updated 4 years ago
- This repository provides updates and extended data from Kelly, B., Papanikolaou, D., Seru, A. and Taddy, M., 2021. American Economic Revi…☆28Nov 29, 2023Updated 2 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆150Oct 16, 2024Updated last year
- Surprisingly Effective Way To Name Matching In Python These are the same product name and customer name but were taken as different form…☆39Mar 30, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Fuzzy string matching, grouping, and evaluation.☆794Jul 10, 2025Updated 9 months ago
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆422Apr 9, 2026Updated last week
- Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…☆166Apr 3, 2026Updated 2 weeks ago
- ☆19Oct 10, 2020Updated 5 years ago
- Abydos NLP/IR library for Python☆194Nov 10, 2022Updated 3 years ago
- Representation Learning of Entities and Documents from Knowledge Base Descriptions☆18Oct 6, 2018Updated 7 years ago
- A fast, precise, pure Python implementation of Fisher's exact test☆12Mar 27, 2017Updated 9 years ago
- German small and large versions of GPT2.☆20May 11, 2022Updated 3 years ago
- creating a dataset for person name disambiguation using combination of sources like wikipedia, DBLP authors and PPDB.☆52Jul 31, 2017Updated 8 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DeText: A Deep Neural Text Understanding Framework for Ranking and Classification Tasks☆1,266Mar 2, 2023Updated 3 years ago
- ☆12Mar 20, 2020Updated 6 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆104Feb 26, 2024Updated 2 years ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆92Mar 11, 2026Updated last month
- sketching algorithms implemented in chapel and python☆10Jun 8, 2017Updated 8 years ago
- PYthon Automated Term Extraction☆318Feb 8, 2023Updated 3 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11May 7, 2020Updated 5 years ago
- Software that makes labeling PDFs easy.☆428May 13, 2024Updated last year
- A Shiny web app template using a dark theme with support for custom CSS☆13Feb 24, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.☆320Mar 1, 2024Updated 2 years ago
- Updated source code examples from Lucene in Action. Educational purposes.☆16Apr 6, 2026Updated last week
- Start your journey into social media analysis of politicans by using Python (Tutorial)☆21Mar 26, 2019Updated 7 years ago
- Full-Shape Power Spectrum and Bispectrum Likelihoods☆13Apr 22, 2024Updated last year
- ☆17Sep 26, 2020Updated 5 years ago
- Deepparse is a state-of-the-art library for parsing multinational street addresses using deep learning☆338Apr 1, 2026Updated 2 weeks ago
- This database is a record of NYPD misconduct complaints made by the public to the Civilian Complaint Review Board (CCRB).☆13May 11, 2023Updated 2 years ago
- Slides from my talk on spaCy IRL, regarding sparse attention.☆12Jul 9, 2019Updated 6 years ago
- Name matching algorithm for company and people name in English☆15Dec 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆199Dec 18, 2022Updated 3 years ago
- Super Fast String Matching in Python☆368Mar 14, 2025Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Jul 23, 2023Updated 2 years ago
- Build Semantic Search with S-BERT and Fine-tune your model in unsupervised way☆59Apr 26, 2022Updated 3 years ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆218Jun 14, 2022Updated 3 years ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.☆13Mar 9, 2022Updated 4 years ago