xrr233 / WebformerLinks
SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval
☆48Updated 2 years ago
Alternatives and similar repositories for Webformer
Users that are interested in Webformer are comparing it to the libraries listed below
Sorting:
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆38Updated 10 months ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆72Updated 2 years ago
- Build Text Rerankers with Deep Language Models☆262Updated last year
- An Open-Source Package for Information Retrieval☆164Updated this week
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆59Updated 2 years ago
- Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction"☆87Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆104Updated 2 years ago
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 3 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆63Updated 2 years ago
- PyTorch implementation and pre-trained models for ASP - Autoregressive Structured Prediction with Language Models, EMNLP 22. https://arxi…☆106Updated last year
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆104Updated last year
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆126Updated 3 years ago
- [NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages☆20Updated 3 years ago
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆119Updated 8 months ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆141Updated last year
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆157Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆163Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆53Updated last year
- ☆183Updated 2 years ago
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆69Updated 2 years ago
- The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. …☆145Updated 2 years ago
- The unified platform for data-related resources.☆134Updated 2 years ago
- Codebase for RetroMAE and beyond.☆264Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆159Updated last year
- A extension of Transformers library to include T5ForSequenceClassification class.☆39Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆150Updated 2 weeks ago
- ☆68Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆91Updated 4 months ago