jingtaozhan/RepBERT-Index

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jingtaozhan/RepBERT-Index)

jingtaozhan / RepBERT-Index

RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddings. The inner products of them are regarded as relevance scores. Its efficiency is comparable to bag-of-words methods.

☆66

Alternatives and similar repositories for RepBERT-Index

Users that are interested in RepBERT-Index are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jingtaozhan / DRhard
View on GitHub
SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
☆127Feb 15, 2022Updated 4 years ago
jingtaozhan / bert-ranking-analysis
View on GitHub
SIGIR'20: An Analysis of BERT in Document Ranking
☆21Jul 27, 2020Updated 6 years ago
Albert-Ma / PROP
View on GitHub
WSDM'2021, PROP and SIGIR'2021,B-PROP
☆110May 18, 2023Updated 3 years ago
jingtaozhan / RepCONC
View on GitHub
WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval
☆119Aug 7, 2024Updated last year
jingtaozhan / extrapolate-eval
View on GitHub
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆10Aug 4, 2022Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
jingtaozhan / disentangled-retriever
View on GitHub
An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.
☆60May 17, 2023Updated 3 years ago
xuanyuan14 / ARES
View on GitHub
SIGIR'22 paper: Axiomatically Regularized Pre-training for Ad hoc Search
☆23May 24, 2023Updated 3 years ago
jingtaozhan / JPQ
View on GitHub
CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
☆52Feb 19, 2022Updated 4 years ago
zhengyima / Anchors
View on GitHub
Source code of CIKM2021 Paper 'Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need'
☆16Aug 30, 2021Updated 4 years ago
oneal2000 / Wikiformer
View on GitHub
Code for AAAI 2024 paper Wikiformer
☆20Dec 21, 2023Updated 2 years ago
nyu-dl / dl4marco-bert
View on GitHub
☆485Jan 26, 2022Updated 4 years ago
thunlp / OpenMatch
View on GitHub
An Open-Source Package for Information Retrieval.
☆442Oct 7, 2022Updated 3 years ago
microsoft / ANCE
View on GitHub
A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
☆386Jan 6, 2026Updated 6 months ago
luyug / Condenser
View on GitHub
EMNLP 2021 - Pre-training architectures for dense retrieval
☆256Mar 18, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
HansiZeng / CL-DRD
View on GitHub
[SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".
☆23Apr 29, 2022Updated 4 years ago
CSHaitao / JTR
View on GitHub
The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval
☆28Jun 7, 2023Updated 3 years ago
jordane95 / dual-cross-encoder
View on GitHub
Dual Cross Encoder for Dense Retrieval
☆18Mar 15, 2023Updated 3 years ago
luyug / Reranker
View on GitHub
Build Text Rerankers with Deep Language Models
☆265Feb 20, 2024Updated 2 years ago
Albert-Ma / COSTA
View on GitHub
SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction
☆27Nov 8, 2022Updated 3 years ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
Guzpenha / transformers_cl
View on GitHub
Code for the paper "Curriculum Learning Strategies for IR: An Empirical Study on Conversation Response Ranking" at ECIR'20
☆17Dec 8, 2022Updated 3 years ago
sebastian-hofstaetter / matchmaker
View on GitHub
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
☆265Jan 27, 2023Updated 3 years ago
nyu-dl / dl4ir-doc2query
View on GitHub
☆162May 10, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Georgetown-IR-Lab / cedr
View on GitHub
Code for CEDR: Contextualized Embeddings for Document Ranking, accepted at SIGIR 2019.
☆156Nov 6, 2020Updated 5 years ago
sebastian-hofstaetter / sigir19-neural-ir
View on GitHub
Source code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
☆48Apr 30, 2019Updated 7 years ago
thunlp / DANCE
View on GitHub
☆16Aug 2, 2021Updated 4 years ago
microsoft / AR2
View on GitHub
☆71Jun 16, 2022Updated 4 years ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
iai-group / table-retrieval
View on GitHub
☆11Jan 3, 2023Updated 3 years ago
luyug / COIL
View on GitHub
NAACL2021 - COIL Contextualized Lexical Retriever
☆158Jul 27, 2021Updated 5 years ago
microsoft / BiDR
View on GitHub
Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval
☆16Mar 1, 2022Updated 4 years ago
canjiali / PARADE
View on GitHub
code and data to faciliate BERT/ELECTRA for document ranking. Details refer to the paper - PARADE: Passage Representation Aggregation for…
☆96Mar 25, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chauff / conversationalIR
View on GitHub
Overview of venues, research themes and datasets relevant for conversational search.
☆146Aug 9, 2022Updated 3 years ago
oaqa / FlexNeuART
View on GitHub
Flexible classic and NeurAl Retrieval Toolkit
☆224Jun 28, 2025Updated last year
caiyinqiong / Semantic-Retrieval-Models
View on GitHub
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…
☆341Jun 17, 2023Updated 3 years ago
jingtaozhan / IntelligenceTest
View on GitHub
An evaluation framework to test AI in a trial-and-error process. It is a simplified Natural Selection test.
☆22Mar 11, 2025Updated last year
AdeDZY / DeepCT
View on GitHub
DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.
☆325May 9, 2021Updated 5 years ago
microsoft / MSMARCO-Document-Ranking
View on GitHub
MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …
☆132Jan 3, 2022Updated 4 years ago
Georgetown-IR-Lab / OpenNIR
View on GitHub
An end-to-end neural ad-hoc ranking pipeline.
☆153Jul 13, 2025Updated last year