luyug/GradCache

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/luyug/GradCache)

luyug / GradCache

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

☆443

Alternatives and similar repositories for GradCache

Users that are interested in GradCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luyug / GC-DPR
View on GitHub
Train Dense Passage Retriever (DPR) with a single GPU
☆136Jun 16, 2021Updated 5 years ago
luyug / Condenser
View on GitHub
EMNLP 2021 - Pre-training architectures for dense retrieval
☆256Mar 18, 2022Updated 4 years ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
luyug / COIL
View on GitHub
NAACL2021 - COIL Contextualized Lexical Retriever
☆158Jul 27, 2021Updated 4 years ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 2 weeks ago
DAMO-NLP-SG / Inf-CLIP
View on GitHub
[CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for C…
☆287Jan 16, 2025Updated last year
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
luyug / Reranker
View on GitHub
Build Text Rerankers with Deep Language Models
☆265Feb 20, 2024Updated 2 years ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
microsoft / ANCE
View on GitHub
A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks
☆385Jan 6, 2026Updated 6 months ago
staoxiao / RetroMAE
View on GitHub
Codebase for RetroMAE and beyond.
☆275Jun 7, 2024Updated 2 years ago
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
ielab / asyncval
View on GitHub
A toolkit for asynchronously validating dense retriever checkpoints during training.
☆27Aug 10, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jingtaozhan / DRhard
View on GitHub
SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
☆127Feb 15, 2022Updated 4 years ago
OpenMatch / ANCE-Tele
View on GitHub
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Mar 25, 2024Updated 2 years ago
google-research / t5x_retrieval
View on GitHub
☆102Dec 17, 2022Updated 3 years ago
microsoft / AR2
View on GitHub
☆70Jun 16, 2022Updated 4 years ago
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,243Oct 16, 2025Updated 9 months ago
Muennighoff / sgpt
View on GitHub
SGPT: GPT Sentence Embeddings for Semantic Search
☆872Feb 17, 2024Updated 2 years ago
DevSinghSachan / emdr2
View on GitHub
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…
☆110Apr 18, 2022Updated 4 years ago
castorini / pyserini
View on GitHub
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
☆2,100Updated this week
ict-bigdatalab / awesome-pretrained-models-for-information-retrieval
View on GitHub
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).
☆677Jan 7, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,868Apr 6, 2023Updated 3 years ago
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆779Apr 7, 2023Updated 3 years ago
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
sebastian-hofstaetter / matchmaker
View on GitHub
Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch
☆265Jan 27, 2023Updated 3 years ago
OpenMatch / COCO-DR
View on GitHub
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…
☆51Oct 12, 2023Updated 2 years ago
castorini / dhr
View on GitHub
Dense hybrid representations for text retrieval
☆65Apr 3, 2023Updated 3 years ago
sebastian-hofstaetter / colberter
View on GitHub
☆47Mar 27, 2022Updated 4 years ago
AkariAsai / XORQA
View on GitHub
This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".
☆80Jun 3, 2021Updated 5 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
thongnt99 / learned-sparse-retrieval
View on GitHub
Unified Learned Sparse Retrieval Framework
☆68May 13, 2024Updated 2 years ago
nomic-ai / contrastors
View on GitHub
Train Models Contrastively in Pytorch
☆798Mar 26, 2025Updated last year
thunlp / OpenMatch
View on GitHub
An Open-Source Package for Information Retrieval.
☆442Oct 7, 2022Updated 3 years ago
lucidrains / n-grammer-pytorch
View on GitHub
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆81Dec 4, 2022Updated 3 years ago
AranKomat / Metroplex
View on GitHub
☆21Mar 15, 2023Updated 3 years ago
tunib-ai / parallelformers
View on GitHub
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
☆787Apr 24, 2023Updated 3 years ago