[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrastive and Distributionally Robust Learning".
☆50Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for COCO-DR
Users that are interested in COCO-DR are comparing it to the libraries listed below
Sorting:
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder☆10Mar 16, 2023Updated 3 years ago
- [ML4H 2022] This is the code for our paper `Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions …☆26Feb 6, 2024Updated 2 years ago
- [AAAI 2023] This is the code for our paper `Neighborhood-Regularized Self-Training for Learning with Few Labels'.☆12Jan 11, 2023Updated 3 years ago
- This is the artifact for paper “Are Machine Learning Cloud APIs Used Correctly? (#421)” in ICSE2021☆16Feb 27, 2021Updated 5 years ago
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 9 months ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆60Jul 11, 2021Updated 4 years ago
- This is the repository for paper `Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks'.☆14Nov 22, 2023Updated 2 years ago
- ☆24Oct 23, 2020Updated 5 years ago
- ☆35May 18, 2023Updated 2 years ago
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆30Oct 24, 2023Updated 2 years ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Jun 23, 2024Updated last year
- ☆54Jan 18, 2023Updated 3 years ago
- ☆12May 17, 2022Updated 3 years ago
- SIGIR'2022, Pre-train a Discriminative Text Encoder for Dense Retrieval via Contrastive Span Prediction☆27Nov 8, 2022Updated 3 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆265Jan 27, 2023Updated 3 years ago
- ☆70Jun 16, 2022Updated 3 years ago
- SGPT: GPT Sentence Embeddings for Semantic Search☆873Feb 17, 2024Updated 2 years ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆772Apr 7, 2023Updated 2 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆133Aug 6, 2025Updated 7 months ago
- This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…☆16Jun 3, 2024Updated last year
- Extracting six domain-specific QA datasets from MS MARCO☆17Dec 1, 2019Updated 6 years ago
- Inquisitive Parrots for Search☆200Jun 5, 2025Updated 9 months ago
- [SIGIR24] Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval☆18Feb 29, 2024Updated 2 years ago
- ☆24Jun 28, 2023Updated 2 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Feb 19, 2022Updated 4 years ago
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 3 months ago
- SPLADE: sparse neural search (SIGIR21, SIGIR22)☆983May 3, 2024Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- [EMNLP 2022] Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning☆136Nov 17, 2023Updated 2 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Oct 27, 2021Updated 4 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- ☆102Dec 17, 2022Updated 3 years ago
- RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddings…☆66Oct 13, 2021Updated 4 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆734Jan 26, 2026Updated last month
- [ACL 2024] This is the code for our paper ”RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records“.☆41Sep 19, 2024Updated last year
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆345Oct 10, 2023Updated 2 years ago
- Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024☆27Nov 13, 2024Updated last year