wbsg-uni-mannheim / productbert-intermediate
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆37Updated 2 years ago
Alternatives and similar repositories for productbert-intermediate:
Users that are interested in productbert-intermediate are comparing it to the libraries listed below
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Updated 3 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆13Updated 4 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 3 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- A Corpus of 475,000 Industrial Occupations☆66Updated 4 years ago
- ☆30Updated 2 years ago
- Bi-encoder Based Entity Linking Tutorial. You can run experiment only in 5 minutes. Experiments on Co-lab pro GPU are also supported!☆34Updated 3 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 3 years ago
- ☆32Updated 3 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 10 months ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆270Updated 11 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 7 months ago
- Package that returns a company embedding given a company name☆45Updated 4 years ago
- doccano auto labeling pipeline helps doccano to annotate a document automatically.☆42Updated last year
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆96Updated last year
- simple rule based named entity recognition☆43Updated 3 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- Creating class-based TF-IDF matrices☆83Updated 2 years ago
- Table2Vec: Neural Word and Entity Embeddings for Table Population and Retrieval☆23Updated 6 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆31Updated last year
- ⚖️ Neural network for product matching, aka classifying whether two product titles represent the same entity☆66Updated last year