anhaidgroup / deepmatcher
Python package for performing Entity and Text Matching using Deep Learning.
☆580Updated 8 months ago
Alternatives and similar repositories for deepmatcher:
Users that are interested in deepmatcher are comparing it to the libraries listed below
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆270Updated 10 months ago
- ☆188Updated 9 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆136Updated 7 months ago
- Code and data for Sato https://arxiv.org/abs/1911.06311.☆112Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- This repository contains the code and data download links to reproduce the experiments of the PVLDB paper "Dual-Objective Fine-Tuning of …☆14Updated 3 years ago
- Entity resolution using zero labeled examples☆28Updated 8 months ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆215Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆986Updated last year
- REL: Radboud Entity Linker☆306Updated 11 months ago
- Fuzzy string matching, grouping, and evaluation.☆752Updated 3 weeks ago
- ☆32Updated 3 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆397Updated 3 years ago
- Compute Sentence Embeddings Fast!☆621Updated 2 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆923Updated 6 months ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 8 months ago
- Entity Linker solution☆1,184Updated last year
- This repository contains source code for the TaBERT model, a pre-trained language model for learning joint representations of natural lan…☆594Updated 3 years ago
- PYthon Automated Term Extraction☆311Updated 2 years ago
- Anserini is a Lucene toolkit for reproducible information retrieval research☆1,046Updated last week
- A list of free data matching and record linkage software.☆378Updated last year
- Recent trends of Entity Linking, Disambiguation, and Representation.☆345Updated 3 years ago
- Entity Disambiguation as text extraction (ACL 2022)☆181Updated 2 years ago
- Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents☆284Updated last year
- Autoregressive Entity Retrieval☆781Updated last year
- Character-based word embeddings model based on RNN for handling real world texts☆173Updated last year
- This repository contains code and data download scripts for the paper "Using schema.org annotations for training and maintaining product …☆15Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆144Updated 4 months ago
- ☆15Updated 8 months ago