sebastian-hofstaetter / tas-balanced-dense-retrieval
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
โ58Updated 3 years ago
Related projects: โ
- โ36Updated last year
- ๐ฆฎ Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieโฆโ49Updated 2 years ago
- RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddingsโฆโ65Updated 2 years ago
- Dense hybrid representations for text retrievalโ60Updated last year
- โ17Updated 3 years ago
- โ55Updated last year
- Tools for the TREC CAsT benchmarkโ26Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillationโ108Updated 3 years ago
- A Python framework for conversational searchโ40Updated 2 years ago
- Unified Learned Sparse Retrieval Frameworkโ57Updated 4 months ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.โ27Updated last year
- โ23Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answeringโ166Updated 3 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021โ27Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"โ91Updated last year
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"โ41Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.โ71Updated 2 years ago
- Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.โ35Updated 2 years ago
- Source code of CIKM2021 Paper 'Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need'โ17Updated 3 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.โ49Updated 2 years ago
- โ33Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".โ77Updated last year
- MIMICS: A Large-Scale Data Collection for Search Clarificationโ74Updated 4 years ago
- [SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".โ23Updated 2 years ago
- โ24Updated 3 years ago
- โ33Updated 3 weeks ago
- A toolkit for end-to-end neural ad hoc retrievalโ95Updated 3 weeks ago
- โ45Updated 2 years ago
- A library for open domain query facet extraction and generationโ14Updated 4 months ago
- โ54Updated 2 years ago