klarna / product-page-datasetLinks
☆52Updated 10 months ago
Alternatives and similar repositories for product-page-dataset
Users that are interested in product-page-dataset are comparing it to the libraries listed below
Sorting:
- Index of URLs to pdf files all over the internet and scripts☆23Updated 2 years ago
- Code for "Open Vocabulary Extreme Classification Using Generative Models"☆24Updated 2 years ago
- ☆24Updated last year
- The code related to the baselines from NeurIPS 2021 paper "DUE: End-to-End Document Understanding Benchmark."☆36Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆55Updated 2 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆20Updated 3 years ago
- A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!☆93Updated 3 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆77Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆25Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- [EMNLP 2021] The baseline code for WebSRC dataset.☆50Updated 2 months ago
- It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item …☆44Updated 3 years ago
- ☆17Updated last month
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆14Updated 3 weeks ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 2 months ago
- Retrieval as Attention☆82Updated 2 years ago
- ☆39Updated 3 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Big-Interleaved-Dataset☆57Updated 2 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Updated 2 years ago
- ☆95Updated 2 years ago
- ☆23Updated last year
- ☆38Updated last year
- ☆64Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆56Updated this week
- Command-line tool for downloading and extending the RedCaps dataset.☆47Updated last year