wbsg-uni-mannheim / productbert-intermediateLinks
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆38Updated 2 years ago
Alternatives and similar repositories for productbert-intermediate
Users that are interested in productbert-intermediate are comparing it to the libraries listed below
Sorting:
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- Explainable Zero-Shot Topic Extraction☆63Updated 10 months ago
- ☆43Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 4 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆71Updated 10 months ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆81Updated 4 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated last year
- Creating class-based TF-IDF matrices☆84Updated 2 years ago
- Template Extraction from unstructured Wikipedia text using NLP techniques.☆41Updated 5 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆20Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆96Updated 2 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago
- Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for impr…☆52Updated last year
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Updated 4 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated 10 months ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆36Updated 3 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆39Updated 3 years ago