ArjitJ / DIALLinks
Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"
☆17Updated 3 years ago
Alternatives and similar repositories for DIAL
Users that are interested in DIAL are comparing it to the libraries listed below
Sorting:
- To reproduce experiments of the paper "Entity Matching with Transformer Architectures"☆27Updated 6 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆32Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆20Updated 4 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- Model for learning document embeddings along with their uncertainties☆36Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 4 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆159Updated 3 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆58Updated 4 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Updated 3 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- ☆32Updated 4 years ago
- ☆34Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Updated 2 years ago
- Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)☆82Updated 3 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Updated 3 years ago
- Combining encoder-based language models☆11Updated 4 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆76Updated last month
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- spaCy match and replace, maintaining conjugation☆36Updated 3 years ago
- ☆40Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 10 months ago
- ☆59Updated 4 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Updated 7 months ago
- string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].☆63Updated 2 years ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 4 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆87Updated 3 years ago