The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
☆151Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVE
Users that are interested in MAVE are comparing it to the libraries listed below
Sorting:
- ACL19-Scaling Up Open Tagging from Tens to Thousands☆17Aug 23, 2019Updated 6 years ago
- A Pytorch implementation of "Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Prod…☆60Feb 18, 2020Updated 6 years ago
- Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title☆84Sep 11, 2019Updated 6 years ago
- ☆88Sep 15, 2020Updated 5 years ago
- Unofficial implementation of the paper "OpenTag: Open Attribute Value Extraction from Product Profiles"☆33Aug 22, 2018Updated 7 years ago
- ☆10Oct 19, 2020Updated 5 years ago
- Codes and Datasets for the ACL2023 Findings Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsen…☆39Mar 3, 2025Updated last year
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search☆346Oct 7, 2024Updated last year
- ☆10Oct 31, 2019Updated 6 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆23May 31, 2022Updated 3 years ago
- Large online shopping companies need to automatically populate their product descriptions supplied by the sellers. Many a times the text …☆11Jul 4, 2018Updated 7 years ago
- ☆11Sep 7, 2021Updated 4 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆115May 22, 2023Updated 2 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)☆12May 17, 2020Updated 5 years ago
- Refined Commonsense Knowledge from Large-Scale Web Contents (TKDE 2022)☆12Oct 19, 2022Updated 3 years ago
- Ranking Models in Unlabeled New Environments (iccv21)☆16Aug 21, 2021Updated 4 years ago
- ☆13Sep 5, 2021Updated 4 years ago
- High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection, CVPR, 2019☆13Aug 20, 2019Updated 6 years ago
- A Dataset for Conversational Recommendation over KnowledgeGraph in E-commerce☆51Sep 26, 2021Updated 4 years ago
- Aspect Term Extractor for the competition SemEval-2014 Task 4 on Aspect Based Sentiment Analysis☆14Jul 20, 2017Updated 8 years ago
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆14Mar 5, 2018Updated 8 years ago
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 3 years ago
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIR…☆17Feb 19, 2022Updated 4 years ago
- OPUS (opus.nlpl.eu) Python3 API☆18Nov 23, 2024Updated last year
- This repository contains the code and data download links to reproduce the experiments of the PVLDB paper "Dual-Objective Fine-Tuning of …☆16Jun 7, 2021Updated 4 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Nov 23, 2022Updated 3 years ago
- The official repository for "Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP" published in ACL-IJNLP 2…☆20Apr 22, 2022Updated 3 years ago
- Faithfully Explainable Recommendation via Neural Logic Reasoning☆16May 3, 2021Updated 4 years ago
- ☆23May 25, 2022Updated 3 years ago
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆19Nov 14, 2022Updated 3 years ago
- Official Implementation of paper: Well Googled is Half Done: Multimodal Forecasting of New FashionProduct Sales with Image-based Google T…☆57Jul 23, 2022Updated 3 years ago
- Winner system (DAMO-NLP) of SemEval 2022 MultiCoNER shared task over 10 out of 13 tracks.☆186Jan 10, 2023Updated 3 years ago
- Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"☆18Feb 9, 2020Updated 6 years ago
- ☆22Sep 13, 2021Updated 4 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- ☆25Jun 25, 2021Updated 4 years ago
- Official Implementation of Few-shot Visual Relationship Co-localization☆25Aug 25, 2021Updated 4 years ago