ma787639046 / bowdprLinks

[SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval

☆17

Alternatives and similar repositories for bowdpr

Users that are interested in bowdpr are comparing it to the libraries listed below

Sorting:

orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
castorini / perm-sc
Official codebase for permutation self-consistency.
☆18Updated last year
ielab / PromptReps
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆50Updated last month
kaistAI / InstructIR
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Updated last year
alon-albalak / FLAD
Few-shot Learning with Auxiliary Data
☆28Updated last year
lovodkin93 / attribute-first-then-generate
Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024
☆29Updated 6 months ago
castorini / LiT5
☆18Updated 11 months ago
oriram / spider
☆54Updated 2 years ago
OSU-NLP-Group / In-Context-Reranking
[ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"
☆25Updated 3 months ago
stanford-futuredata / Baleen
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)
☆45Updated 3 years ago
HKUNLP / ProGen
[EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.
☆27Updated 2 years ago
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 2 years ago
icip-cas / SelfRetrieval
☆34Updated 8 months ago
ielab / Starbucks
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆21Updated 2 weeks ago
OSU-NLP-Group / AttrScore
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Updated 2 years ago
jingtaozhan / extrapolate-eval
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆11Updated 2 years ago
alibaba / SimCSE-with-CARDS
Source code for SIGIR 2022 paper.
☆15Updated 3 years ago
liujch1998 / memo-trap
☆21Updated 2 years ago
neulab / retomaton
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
☆73Updated 3 years ago
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated 11 months ago
jzbjyb / ReAtt
Retrieval as Attention
☆83Updated 2 years ago
INK-USC / FiD-ICL
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆15Updated last year
caskcsg / ir
Collections of IR Research
☆35Updated 2 months ago
soyoung97 / ListT5
official repository for ListT5
☆46Updated 5 months ago
swj0419 / kNN_prompt
TBC
☆27Updated 2 years ago
HansiZeng / scaling-retriever
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆15Updated 3 months ago
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
neulab / data-agora
[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆33Updated 7 months ago
csarron / BTR
☆16Updated last year
sebastian-hofstaetter / colberter
☆47Updated 3 years ago