wbsg-uni-mannheim / productbert-intermediateLinks
This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆38Updated 2 years ago
Alternatives and similar repositories for productbert-intermediate
Users that are interested in productbert-intermediate are comparing it to the libraries listed below
Sorting:
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆73Updated last year
- Creating class-based TF-IDF matrices☆90Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Information extraction pipeline containing coreference resolution, named entity linking, and relationship extraction☆81Updated 4 years ago
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆103Updated 4 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆56Updated 2 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.☆134Updated 3 years ago
- A Corpus of 475,000 Industrial Occupations☆69Updated 4 years ago
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆14Updated 5 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 5 months ago
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Developing a Knowledge Graph-based Question and Answering program to extract information from huge dataset☆95Updated 2 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 3 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆160Updated 4 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- AI apps/benchmark for legaltech☆112Updated 3 years ago
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Topic Inference with Zeroshot models☆61Updated 2 years ago
- ☆18Updated 4 years ago
- Expose a Top2Vec model with a REST API.☆92Updated 2 years ago
- ☆43Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago