JetRunner/LaPraDoR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JetRunner/LaPraDoR)

JetRunner / LaPraDoR

🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval"

☆49

Alternatives and similar repositories for LaPraDoR

Users that are interested in LaPraDoR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thakur-nandan / sprint
View on GitHub
SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.
☆48Jul 25, 2023Updated 2 years ago
UKPLab / gpl
View on GitHub
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …
☆343Jul 6, 2023Updated 3 years ago
nreimers / beir-sparta
View on GitHub
Re-Implementation of SPARTA model
☆13Oct 1, 2021Updated 4 years ago
castorini / mr.tydi
View on GitHub
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
☆83Feb 16, 2022Updated 4 years ago
samuelbroscheit / wikiextractor-wikimentions
View on GitHub
A tool for extracting plain text and internal Wikipedia links from Wikipedia dumps
☆11Apr 18, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
luyug / Condenser
View on GitHub
EMNLP 2021 - Pre-training architectures for dense retrieval
☆256Mar 18, 2022Updated 4 years ago
sebastian-hofstaetter / colberter
View on GitHub
☆47Mar 27, 2022Updated 4 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
ielab / asyncval
View on GitHub
A toolkit for asynchronously validating dense retriever checkpoints during training.
☆27Aug 10, 2023Updated 2 years ago
microsoft / BiDR
View on GitHub
Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval
☆16Mar 1, 2022Updated 4 years ago
zh-zheng / BERT-QE
View on GitHub
Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".
☆51Oct 10, 2021Updated 4 years ago
clips / mfaq
View on GitHub
MFAQ: a Multilingual FAQ Dataset
☆18Sep 17, 2023Updated 2 years ago
facebookresearch / reconsider
View on GitHub
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆50Apr 26, 2021Updated 5 years ago
Albert-Ma / COSTA
View on GitHub
SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction
☆27Nov 8, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oriram / spider
View on GitHub
☆55Jan 18, 2023Updated 3 years ago
starsuzi / DAR
View on GitHub
☆19Sep 19, 2022Updated 3 years ago
unicamp-dl / mMARCO
View on GitHub
A multilingual version of MS MARCO passage ranking dataset
☆148Oct 19, 2023Updated 2 years ago
fresh-stack / freshstack
View on GitHub
This repository helps you evaluate your models on the FreshStack benchmark!
☆34Dec 9, 2025Updated 7 months ago
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,246Oct 16, 2025Updated 9 months ago
chengzhipanpan / DCSR
View on GitHub
Code for paper Sentence-aware Contrastive Learning for Open-Domain Passage Retrieval, Accepted by ACL2022 Main Conference, Long Paper
☆30Mar 12, 2022Updated 4 years ago
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 2 weeks ago
tricktreat / piqn
View on GitHub
Code for "Parallel Instance Query Network for Named Entity Recognition", accepted at ACL 2022.
☆76Oct 28, 2022Updated 3 years ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DevSinghSachan / emdr2
View on GitHub
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…
☆110Apr 18, 2022Updated 4 years ago
thakur-nandan / income
View on GitHub
INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.
☆24Sep 24, 2023Updated 2 years ago
hallogameboy / QDS-Transformer
View on GitHub
☆16Sep 28, 2020Updated 5 years ago
epfml / X2Static
View on GitHub
X2Static embeddings
☆16Jul 15, 2021Updated 5 years ago
LuisaMaerz / KnowMAN
View on GitHub
KnowMAN: Weakly Supervised Multinomial Adversarial Networks
☆12Nov 9, 2021Updated 4 years ago
thakur-nandan / beir-ColBERT
View on GitHub
Evaluation of BEIR Datasets using ColBERT retrieval model
☆18Mar 4, 2022Updated 4 years ago
pedrada88 / crossembeddings-twitter
View on GitHub
☆14May 15, 2020Updated 6 years ago
neulab / knn-transformers
View on GitHub
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆287Oct 20, 2022Updated 3 years ago
naver / splade
View on GitHub
SPLADE: sparse neural search (SIGIR21, SIGIR22)
☆999May 3, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Yibin-Lei / CSQE
View on GitHub
Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"
☆13Mar 19, 2024Updated 2 years ago
Alibaba-NLP / MuVER
View on GitHub
[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations
☆31May 23, 2022Updated 4 years ago
choe-hyonsu-gabrielle / korean-amr-corpus
View on GitHub
Korean Abstract Meaning Representation (AMR) Corpus
☆10Feb 27, 2022Updated 4 years ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
microsoft / MoPQ
View on GitHub
☆13Nov 26, 2021Updated 4 years ago
keyonvafa / sequential-rationales
View on GitHub
Rationales for Sequential Predictions
☆39Mar 10, 2022Updated 4 years ago