The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. It is a large, multi-sourced, diverse dataset for product attribute extraction study.
☆155Dec 16, 2022Updated 3 years ago
Alternatives and similar repositories for MAVE
Users that are interested in MAVE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ACL19-Scaling Up Open Tagging from Tens to Thousands☆17Aug 23, 2019Updated 6 years ago
- ☆88Sep 15, 2020Updated 5 years ago
- Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Product Title☆85Sep 11, 2019Updated 6 years ago
- A Pytorch implementation of "Scaling Up Open Tagging from Tens to Thousands: Comprehension Empowered Attribute Value Extraction from Prod…☆62Feb 18, 2020Updated 6 years ago
- Attribute Value Extraction using Large Language Models☆28May 24, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Shopping Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search☆364Apr 8, 2026Updated last month
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 4 years ago
- [ACL 2023] Codes and Datasets for Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsense☆41Mar 3, 2025Updated last year
- [ACL-IJCNLP 2021] Structural Knowledge Distillation: Tractably Distilling Information for Structured Predictor☆10Jul 10, 2023Updated 2 years ago
- ☆11Sep 7, 2021Updated 4 years ago
- ☆14Apr 18, 2020Updated 6 years ago
- Code for paper "Conversational Product Search Based on Negative Feedback"☆12Jun 26, 2020Updated 5 years ago
- K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (Findings of EMNLP …☆31Jan 6, 2023Updated 3 years ago
- The implementation our EMNLP 2021 paper "Enhanced Language Representation with Label Knowledge for Span Extraction".☆115May 22, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for our EMNLP 2020 Paper "AIN: Fast and Accurate Sequence Labeling with Approximate Inference Network"☆20Nov 14, 2022Updated 3 years ago
- Learning Ontologies Via Embeddings☆12Jul 6, 2023Updated 2 years ago
- Large online shopping companies need to automatically populate their product descriptions supplied by the sellers. Many a times the text …☆11Jul 4, 2018Updated 7 years ago
- ☆10Oct 31, 2019Updated 6 years ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- 基于多模态的属性抽取☆45Aug 6, 2020Updated 5 years ago
- [ACL 2020] Structure-Level Knowledge Distillation For Multilingual Sequence Labeling☆73Nov 23, 2022Updated 3 years ago
- TER-plus Machine Translation metric.☆31May 23, 2022Updated 4 years ago
- A Dataset for Conversational Recommendation over KnowledgeGraph in E-commerce☆51Sep 26, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆22Sep 13, 2021Updated 4 years ago
- The source code is for the paper: “Triple Sequence Learning for Cross-domain Recommendation” accepted in TOIS by Haokai Ma, Ruobing Xie, …☆17Mar 1, 2024Updated 2 years ago
- code for our WWW 2019 paper: "Open-world Learning and Application to Product Classification"☆37Apr 19, 2019Updated 7 years ago
- Learning Cross-modal Retrieval with Noisy Labels (CVPR 2021, PyTorch Code)☆13Apr 7, 2021Updated 5 years ago
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- Code for ACL 2019 paper "Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs"☆18Feb 9, 2020Updated 6 years ago
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.☆18Mar 5, 2018Updated 8 years ago
- 🌮 Table-based KB Completer☆16Mar 13, 2024Updated 2 years ago
- The official repository for "Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP" published in ACL-IJNLP 2…☆20Apr 22, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The code for paper "Cross Platforms Linguals and Models Social Bot Detection via Federated Adversarial Contrastive Knowledge Distillation…☆20May 3, 2023Updated 3 years ago
- This is the repository of code and dataset for paper "The Rise of Guardians: Fact-checking URL Recommendation to Combat Fake News", SIGIR…☆17Feb 19, 2022Updated 4 years ago
- CrossWeigh: Training Named Entity Tagger from Imperfect Annotations☆178Jul 25, 2024Updated last year
- ☆18Mar 3, 2023Updated 3 years ago
- WebRED is a large and diverse manually annotated dataset for extracting relationships from a variety of text found on the World Wide Web.☆22Mar 11, 2021Updated 5 years ago
- ☆37Sep 22, 2021Updated 4 years ago
- Iterative Rank-Aware Open IE☆30Jun 24, 2019Updated 6 years ago