xrr233 / WebformerLinks
SIGIR-2022 Webformer: Pre-training with Web Pages for Information Retrieval
☆50Updated 3 years ago
Alternatives and similar repositories for Webformer
Users that are interested in Webformer are comparing it to the libraries listed below
Sorting:
- Simplified DOM Trees for Transferable Attribute Extraction from the Web☆40Updated last year
- An Open-Source Package for Information Retrieval☆168Updated 3 weeks ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.☆60Updated 2 years ago
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆74Updated 3 years ago
- Build Text Rerankers with Deep Language Models☆264Updated last year
- Codebase for RetroMAE and beyond.☆272Updated last year
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆157Updated 4 months ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆142Updated 2 years ago
- Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"☆416Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Updated 3 years ago
- The dataset contains 3 million attribute-value annotations across 1257 unique categories on 2.2 million cleaned Amazon product profiles. …☆151Updated 3 years ago
- Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction"☆87Updated 3 years ago
- Scalable training for dense retrieval models.☆298Updated 7 months ago
- ☆57Updated 10 months ago
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆126Updated last month
- Finetune mistral-7b-instruct for sentence embeddings☆88Updated last year
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆244Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆165Updated 2 years ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆165Updated 3 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆168Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆106Updated last month
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆112Updated 2 years ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆178Updated 2 years ago
- 🌳CED: Catalog Extraction from Documents☆16Updated 2 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆150Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆70Updated 2 years ago
- A toolkit for building dense retrievers with deep language models.☆64Updated 4 years ago
- YuLan-IR: Information Retrieval Boosted LMs☆220Updated last year
- [ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM☆190Updated last year
- ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor☆299Updated 3 years ago