wbsg-uni-mannheim / productbert-intermediate
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆36Updated 2 years ago
Alternatives and similar repositories for productbert-intermediate:
Users that are interested in productbert-intermediate are comparing it to the libraries listed below
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 3 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆36Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆18Updated 3 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Updated 3 years ago
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆13Updated 4 years ago
- ☆18Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 8 months ago
- Vespa application making an index of the CORD-19 dataset.☆39Updated this week
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆46Updated 6 years ago
- Code release for Type-Aware Bi-Encoders for Open-Domain Entity Retrieval☆19Updated 2 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- Explainable Zero-Shot Topic Extraction☆62Updated 5 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆70Updated 4 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated last week
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 3 years ago
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆75Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆104Updated 8 months ago
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆12Updated last year
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆51Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- ☆34Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago