The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
☆151Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVE
Users that are interested in MAVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACL19-Scaling Up Open Tagging from Tens to Thousands☆17Aug 23, 2019Updated 6 years ago
- ☆88Sep 15, 2020Updated 5 years ago
- Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title☆84Sep 11, 2019Updated 6 years ago
- A Pytorch implementation of "Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Prod…☆61Feb 18, 2020Updated 6 years ago
- Unofficial implementation of the paper "OpenTag: Open Attribute Value Extraction from Product Profiles"☆33Aug 22, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Attribute Value Extraction using Large Language Models☆28May 24, 2024Updated last year
- Codes and Datasets for the ACL2023 Findings Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsen…☆39Mar 3, 2025Updated last year
- Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search☆348Oct 7, 2024Updated last year
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- ☆11Sep 7, 2021Updated 4 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Jan 6, 2023Updated 3 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆115May 22, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- Large online shopping companies need to automatically populate their product descriptions supplied by the sellers. Many a times the text …☆11Jul 4, 2018Updated 7 years ago
- ☆10Oct 31, 2019Updated 6 years ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- 基于多模态的属性抽取☆45Aug 6, 2020Updated 5 years ago
- ☆13Sep 5, 2021Updated 4 years ago
- This is the official repository of the revised datasets FUNSD-r and CORD-r, introduced in EMNLP 2023 paper Reading Order Matters: Informa…☆17Mar 20, 2024Updated 2 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆24May 31, 2022Updated 3 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆72Nov 23, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository holds the annotated spreadsheet files, comprising the DECO dataset.☆13Mar 21, 2019Updated 7 years ago
- ☆22Sep 13, 2021Updated 4 years ago
- code for our WWW 2019 paper: "Open-world Learning and Application to Product Classification"☆37Apr 19, 2019Updated 6 years ago
- The 1st place solution for SIGIR 2020 E-Commerce Workshop Multimodal Product Classification Challenge☆21Aug 3, 2020Updated 5 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Apr 7, 2021Updated 4 years ago
- Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"☆18Feb 9, 2020Updated 6 years ago
- Resources for PVLDB 2023 submission☆27Aug 28, 2024Updated last year
- 🌮 Table-based KB Completer☆16Mar 13, 2024Updated 2 years ago
- ELIXIR: Learning from User Feedback on Explanations to Improve Recommender Models☆10Feb 15, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Sep 27, 2024Updated last year
- High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection, CVPR, 2019☆13Aug 20, 2019Updated 6 years ago
- ICLR 2019 paper: "textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR"☆25Dec 30, 2018Updated 7 years ago
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIR…☆17Feb 19, 2022Updated 4 years ago
- The code for paper "Cross Platforms Linguals and Models Social Bot Detection via Federated Adversarial Contrastive Knowledge Distillation…☆20May 3, 2023Updated 2 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆176Jul 25, 2024Updated last year
- ☆18Mar 3, 2023Updated 3 years ago