ArjitJ / DIALLinks
Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"
☆17Updated 3 years ago
Alternatives and similar repositories for DIAL
Users that are interested in DIAL are comparing it to the libraries listed below
Sorting:
- To reproduce experiments of the paper "Entity Matching with Transformer Architectures"☆27Updated 6 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆32Updated 2 years ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆20Updated 4 years ago
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆157Updated 3 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Updated 2 years ago
- ☆59Updated 4 years ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 4 years ago
- ☆34Updated 2 years ago
- ☆32Updated 4 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Updated 4 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- A few-shot learning method based on siamese networks.☆28Updated 2 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆58Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆82Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 10 months ago
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- [KDD 2020] This is the code repository for our KDD'20 paper STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths.☆18Updated 5 years ago
- Knowledge Base Embedding By Cooperative Knowledge Distillation☆67Updated 3 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 4 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆27Updated 4 years ago
- Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)☆81Updated 3 years ago
- Combining encoder-based language models☆11Updated 4 years ago