The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
☆157Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVE
Users that are interested in MAVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆88Sep 15, 2020Updated 5 years ago
- Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title☆84Sep 11, 2019Updated 6 years ago
- A Pytorch implementation of "Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Prod…☆62Feb 18, 2020Updated 6 years ago
- Unofficial implementation of the paper "OpenTag: Open Attribute Value Extraction from Product Profiles"☆33Aug 22, 2018Updated 7 years ago
- Attribute Value Extraction using Large Language Models☆28May 24, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Oct 19, 2020Updated 5 years ago
- Product Attributes Extraction in Indonesian e-Commerce Platform☆32Feb 28, 2022Updated 4 years ago
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆30May 9, 2022Updated 4 years ago
- [ACL 2023] Codes and Datasets for Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsense☆40Mar 3, 2025Updated last year
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆11Jul 10, 2023Updated 2 years ago
- ☆11Sep 7, 2021Updated 4 years ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Jan 6, 2023Updated 3 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆115May 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆20Nov 14, 2022Updated 3 years ago
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- ☆11Oct 31, 2019Updated 6 years ago
- 基于预训练模型的中文关键词抽取方法(论文SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model 的中文版代码)☆12May 17, 2020Updated 6 years ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- 基于多模态的属性抽取☆45Aug 6, 2020Updated 5 years ago
- ☆13Sep 5, 2021Updated 4 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 4 years ago
- This repository holds the annotated spreadsheet files, comprising the DECO dataset.☆13Mar 21, 2019Updated 7 years ago
- A Dataset for Conversational Recommendation over KnowledgeGraph in E-commerce☆51Sep 26, 2021Updated 4 years ago
- ☆22Sep 13, 2021Updated 4 years ago
- code for our WWW 2019 paper: "Open-world Learning and Application to Product Classification"☆37Apr 19, 2019Updated 7 years ago
- The 1st place solution for SIGIR 2020 E-Commerce Workshop Multimodal Product Classification Challenge☆21Aug 3, 2020Updated 5 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Apr 7, 2021Updated 5 years ago
- Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"☆18Feb 9, 2020Updated 6 years ago
- Resources for PVLDB 2023 submission☆28Aug 28, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆18Mar 5, 2018Updated 8 years ago
- ☆21Mar 25, 2023Updated 3 years ago
- ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models☆10Feb 15, 2021Updated 5 years ago
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆42Sep 27, 2024Updated last year
- ICLR 2019 paper: "textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR"☆25Dec 30, 2018Updated 7 years ago
- The official repository for "Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP" published in ACL-IJNLP 2…☆20Apr 22, 2022Updated 4 years ago
- The code for paper "Cross Platforms Linguals and Models Social Bot Detection via Federated Adversarial Contrastive Knowledge Distillation…☆20May 3, 2023Updated 3 years ago