No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Sep 26, 2022Updated 3 years ago
Alternatives and similar repositories for scaling-zero-shot-retrieval
Users that are interested in scaling-zero-shot-retrieval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 10 months ago
- ☆21Sep 6, 2021Updated 4 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆353Dec 21, 2023Updated 2 years ago
- ☆54Jan 18, 2023Updated 3 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆51Oct 10, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Aug 13, 2020Updated 5 years ago
- A multilingual version of MS MARCO passage ranking dataset☆147Oct 19, 2023Updated 2 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- ☆27Jan 23, 2024Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- A remake of sftp written in Rust☆19Aug 29, 2022Updated 3 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆17Jun 27, 2021Updated 4 years ago
- ☆12Apr 29, 2022Updated 3 years ago
- docTTTTTquery document expansion model☆374Mar 25, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆40May 13, 2023Updated 2 years ago
- Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.☆38Jan 9, 2022Updated 4 years ago
- An automated solution for fact-checking using available claims and fake-news datasets to fine-tune state-of-the-art language models publi…☆12Aug 28, 2022Updated 3 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Sep 11, 2020Updated 5 years ago
- ☆102Dec 17, 2022Updated 3 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- Nordlys: Toolkit for entity-oriented and semantic search☆31Mar 23, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆39Nov 27, 2025Updated 4 months ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆60Jul 11, 2021Updated 4 years ago
- A Python Interface to Reproducibility Measures of System-Oriented IR Experiments☆11Dec 2, 2025Updated 4 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Nov 27, 2022Updated 3 years ago
- Code of fine-tuning neural sparse models and training from scratch. #SIGIR2025☆25Mar 11, 2026Updated last month
- A library for creating complex experimental pipelines☆12Jul 25, 2022Updated 3 years ago
- Metadata browser of TREC☆10Mar 27, 2026Updated 3 weeks ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Aug 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Overview of IR/NLP papers covered in my team's reading group.☆10May 5, 2020Updated 5 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- SILO Language Models code repository☆83Feb 23, 2024Updated 2 years ago
- Using questions to summarize large amounts of textual data.☆25Sep 23, 2020Updated 5 years ago
- Jig for the Open-Source IR Replicability Challenge (OSIRRC)☆13Dec 8, 2022Updated 3 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆256Mar 18, 2022Updated 4 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Aug 2, 2023Updated 2 years ago