wbsg-uni-mannheim / productbert-intermediateLinks
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆38Updated 2 years ago
Alternatives and similar repositories for productbert-intermediate
Users that are interested in productbert-intermediate are comparing it to the libraries listed below
Sorting:
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆80Updated 2 years ago
- Creating class-based TF-IDF matrices☆89Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Expose a Top2Vec model with a REST API.☆92Updated 2 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆68Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆73Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.☆134Updated 3 years ago
- AI apps/benchmark for legaltech☆112Updated 4 years ago
- A Corpus of 475,000 Industrial Occupations☆69Updated 4 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 4 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆81Updated 4 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently…☆108Updated last year
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Updated 3 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Exploring NLP weak supervision approaches to train text classification models. The project is also a prototype for a semi-automated text …☆22Updated last year
- ☆30Updated 3 years ago
- Self-Supervision for Named Entity Disambiguation at the Tail☆219Updated 3 years ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Updated 3 years ago
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆14Updated 5 years ago