This repository contains code and data download scripts for the paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš.
☆38Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for productbert-intermediate
Users that are interested in productbert-intermediate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code of Team Rhinobird for Mining the Web of HTML-embedded Product Data Task One at ISWC2020☆14Aug 26, 2020Updated 5 years ago
- This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"☆38Feb 11, 2022Updated 4 years ago
- 🌠Product matching model for an eCommerce platform using FastText, Simple LSTM, Siamese MaLSTM☆51Jul 13, 2019Updated 6 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆15Jul 13, 2023Updated 2 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Python package for performing Entity and Text Matching using Deep Learning.☆615Jun 18, 2024Updated last year
- Source code for ICDE 2020 paper Collective Entity Alignment via Adaptive Features (CEA).☆16Jun 10, 2020Updated 5 years ago
- A simple project that trains an OpenNLP Named Entity Recognition model to identify ingredients in a recipe.☆14Oct 30, 2016Updated 9 years ago
- the implementation of "Entity Resolution via Hierarchical Graph Attention Network"☆24Aug 29, 2023Updated 2 years ago
- ☆13Sep 28, 2020Updated 5 years ago
- A semantic food search web application built with Django, Solr, SBERT, and Docker☆10Apr 14, 2025Updated last year
- Description of Final Assignments of Information Retrieval Course 2021-2022☆13Dec 22, 2021Updated 4 years ago
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Dec 20, 2021Updated 4 years ago
- ☆18Sep 16, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ELT for AEMET weather data.☆16Mar 23, 2025Updated last year
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆309Apr 17, 2024Updated 2 years ago
- Материалы к курсу по Knowledge Graphs☆25Aug 4, 2025Updated 9 months ago
- Implementation of CIKM'18 paper: "MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks∗".☆32Jul 12, 2021Updated 4 years ago
- Implementation of N-Grammer in Flax☆17Nov 3, 2022Updated 3 years ago
- A Python Library for Pushbullet.☆13Jun 26, 2022Updated 3 years ago
- Colab "Jukebox: A Generative Model for Music"☆16Jun 14, 2020Updated 5 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- [ACL 2024] Dataset and Code of "ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction…☆16Jun 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆33Feb 14, 2019Updated 7 years ago
- Personalized Purchase Prediction of Market Baskets with Wasserstein-Based Sequence Matching☆19Aug 9, 2019Updated 6 years ago
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- A crowdsourced list of public datasets on the topic of Food☆26Jan 22, 2018Updated 8 years ago
- Weekly assignment solutions passed with 100/100☆11Feb 5, 2017Updated 9 years ago
- A clinical research interface geared at collecting robust and consistent data by providing a strong framework for designing data dictiona…☆12Sep 25, 2025Updated 7 months ago
- Surfaces nutritional data for products on Rewe.de and Amazon (DE/UK)☆25Apr 23, 2026Updated 2 weeks ago
- LSHDB is a parallel and distributed data engine, which relies on Locality-Sensitive Hashing and noSQL systems, for performing record link…☆34Aug 30, 2022Updated 3 years ago
- django extension which uses telethon to integrate telegram client authorization (phone+code) to your project☆13Apr 15, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for tasks from FAIO(2024-current)☆28Nov 12, 2025Updated 5 months ago
- Cleaning an image to obtain a more usable document using Python☆14Sep 12, 2019Updated 6 years ago
- ☆10Oct 28, 2019Updated 6 years ago
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.☆16Oct 14, 2020Updated 5 years ago
- vIPer: a new tool for IPython notebooks.☆60Jan 7, 2015Updated 11 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago