google-research/t5x_retrieval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research/t5x_retrieval)

google-research / t5x_retrieval

☆102

Alternatives and similar repositories for t5x_retrieval

Users that are interested in t5x_retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

castorini / mr.tydi
View on GitHub
Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.
☆83Feb 16, 2022Updated 4 years ago
zetaalphavector / InPars
View on GitHub
Inquisitive Parrots for Search
☆200Jun 5, 2025Updated last year
OpenMatch / ANCE-Tele
View on GitHub
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Mar 25, 2024Updated 2 years ago
google / flaxformer
View on GitHub
☆371Apr 12, 2024Updated 2 years ago
aiintelligentsystems / next-level-bert
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
guilhermemr04 / scaling-zero-shot-retrieval
View on GitHub
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Sep 26, 2022Updated 3 years ago
jingtaozhan / disentangled-retriever
View on GitHub
An easy-to-use python toolkit for flexibly adapting various neural ranking models to target domain.
☆60May 17, 2023Updated 3 years ago
DevSinghSachan / emdr2
View on GitHub
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…
☆110Apr 18, 2022Updated 4 years ago
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
VictorProkhorov / Text2Path
View on GitHub
[NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models
☆11Apr 27, 2022Updated 4 years ago
OpenMatch / COCO-DR
View on GitHub
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…
☆51Oct 12, 2023Updated 2 years ago
stanfordnlp / multi-distribution-retrieval
View on GitHub
Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval
☆17Jan 16, 2024Updated 2 years ago
henryzhao5852 / BeamDR
View on GitHub
☆15Oct 10, 2021Updated 4 years ago
project-miracl / miracl
View on GitHub
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
☆211Jul 31, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,237Oct 16, 2025Updated 9 months ago
microsoft / Efficient-Large-LM-Trainer
View on GitHub
☆39Jul 25, 2024Updated last year
csarron / BTR
View on GitHub
☆16Mar 3, 2024Updated 2 years ago
luyug / GradCache
View on GitHub
Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint
☆443Mar 26, 2024Updated 2 years ago
google-deepmind / scaling_laws_for_routing
View on GitHub
☆14Jul 21, 2022Updated 3 years ago
pytorch-tpu / transformers
View on GitHub
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
☆17Jun 5, 2025Updated last year
ddehun / DEnsity
View on GitHub
Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"
☆11May 23, 2023Updated 3 years ago
allenai / decon
View on GitHub
decontamination
☆35Mar 4, 2026Updated 4 months ago
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
christophschuhmann / 4MC-4M-Image-Text-Pairs-with-CLIP-embeddings
View on GitHub
I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…
☆17Apr 22, 2021Updated 5 years ago
unicamp-dl / mMARCO
View on GitHub
A multilingual version of MS MARCO passage ranking dataset
☆148Oct 19, 2023Updated 2 years ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Jul 3, 2026Updated 2 weeks ago
facebookresearch / QA-Overlap
View on GitHub
Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"
☆66Aug 31, 2021Updated 4 years ago
google-research / t5x
View on GitHub
☆2,975Jul 9, 2026Updated last week
mukhal / intrinsic-source-citation
View on GitHub
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆19Apr 1, 2025Updated last year
jingtaozhan / extrapolate-eval
View on GitHub
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆10Aug 4, 2022Updated 3 years ago
catie-aq / flashT5
View on GitHub
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆116Oct 30, 2025Updated 8 months ago
LAION-AI / laion50BU
View on GitHub
Un-*** 50 billions multimodality dataset
☆24Sep 14, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
AkariAsai / evidentiality_qa
View on GitHub
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Dec 25, 2022Updated 3 years ago
chang-github-00 / LLM-Predictive-Decoding
View on GitHub
☆16Jul 9, 2025Updated last year
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
microsoft / AR2
View on GitHub
☆70Jun 16, 2022Updated 4 years ago
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 2 weeks ago
thunlp / ConvDR
View on GitHub
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
☆43Dec 9, 2021Updated 4 years ago
JetRunner / LaPraDoR
View on GitHub
🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…
☆49Apr 25, 2022Updated 4 years ago