The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
☆151Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVE
Users that are interested in MAVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACL19-Scaling Up Open Tagging from Tens to Thousands☆17Aug 23, 2019Updated 6 years ago
- ☆88Sep 15, 2020Updated 5 years ago
- A Pytorch implementation of "Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Prod…☆61Feb 18, 2020Updated 6 years ago
- Unofficial implementation of the paper "OpenTag: Open Attribute Value Extraction from Product Profiles"☆33Aug 22, 2018Updated 7 years ago
- Attribute Value Extraction using Large Language Models☆28May 24, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Oct 19, 2020Updated 5 years ago
- Codes and Datasets for the ACL2023 Findings Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsen…☆39Mar 3, 2025Updated last year
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- ☆11Sep 7, 2021Updated 4 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆115May 22, 2023Updated 2 years ago
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆20Nov 14, 2022Updated 3 years ago
- ☆10Oct 31, 2019Updated 6 years ago
- 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)☆12May 17, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- ☆13Sep 5, 2021Updated 4 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 3 years ago
- A Dataset for Conversational Recommendation over KnowledgeGraph in E-commerce☆51Sep 26, 2021Updated 4 years ago
- ☆22Sep 13, 2021Updated 4 years ago
- The source code is for the paper: “Triple Sequence Learning for Cross-domain Recommendation” accepted in TOIS by Haokai Ma, Ruobing Xie, …☆17Mar 1, 2024Updated 2 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Apr 7, 2021Updated 5 years ago
- clip retrieval benchmark☆17May 4, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"☆18Feb 9, 2020Updated 6 years ago
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆15Mar 5, 2018Updated 8 years ago
- 🌮 Table-based KB Completer☆16Mar 13, 2024Updated 2 years ago
- ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models☆10Feb 15, 2021Updated 5 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Sep 27, 2024Updated last year
- The official repository for "Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP" published in ACL-IJNLP 2…☆20Apr 22, 2022Updated 3 years ago
- The code for paper "Cross Platforms Linguals and Models Social Bot Detection via Federated Adversarial Contrastive Knowledge Distillation…☆20May 3, 2023Updated 2 years ago
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIR…☆17Feb 19, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Jul 25, 2024Updated last year
- ☆18Mar 3, 2023Updated 3 years ago
- ☆37Sep 22, 2021Updated 4 years ago
- Iterative Rank-Aware Open IE☆30Jun 24, 2019Updated 6 years ago
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- Words and their images in 98 languages☆14Mar 1, 2019Updated 7 years ago
- OPUS (opus.nlpl.eu) Python3 API☆18Nov 23, 2024Updated last year