guilhermemr04 / scaling-zero-shot-retrievalView external linksLinks
No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval
☆29Sep 26, 2022Updated 3 years ago
Alternatives and similar repositories for scaling-zero-shot-retrieval
Users that are interested in scaling-zero-shot-retrieval are comparing it to the libraries listed below
Sorting:
- ☆11May 17, 2022Updated 3 years ago
- ☆54Jan 18, 2023Updated 3 years ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 8 months ago
- ☆13Aug 13, 2020Updated 5 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆51Oct 10, 2021Updated 4 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- Dense Passage Retrieval using tensorflow-keras on TPU☆17Jun 27, 2021Updated 4 years ago
- Open-source Large Language Models are Strong Zero-shot Query Likelihood Models for Document Ranking☆17Oct 26, 2023Updated 2 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Mar 14, 2022Updated 3 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- ☆21Sep 6, 2021Updated 4 years ago
- A remake of sftp written in Rust☆19Aug 29, 2022Updated 3 years ago
- Code for the NLP4Prog workshop paper "Reading StackOverflow Encourages Cheating: Adding Question TextImproves Extractive Code Generation"☆21Aug 10, 2021Updated 4 years ago
- "Enemy Spotted: In-game Gun Sound Dataset for Gunshot Classification and Localization", accepted at IEEE Conference on Games (GoG) 2022☆21Sep 6, 2024Updated last year
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Sep 11, 2020Updated 5 years ago
- A multilingual version of MS MARCO passage ranking dataset☆147Oct 19, 2023Updated 2 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆63Feb 10, 2026Updated last week
- Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)☆30Apr 27, 2022Updated 3 years ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆100Nov 27, 2022Updated 3 years ago
- docTTTTTquery document expansion model☆374Mar 25, 2023Updated 2 years ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆60Jul 11, 2021Updated 4 years ago
- Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs☆26Mar 9, 2019Updated 6 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆110Nov 23, 2020Updated 5 years ago
- Domain-specific BERT representation for Named Entity Recognition of lab protocol☆29Dec 25, 2020Updated 5 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆67Oct 13, 2020Updated 5 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Code and Data for SIGIR 2020 Paper "Few-Shot Generative Conversational Query Rewriting"☆66Jun 12, 2023Updated 2 years ago
- Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.☆36Jan 9, 2022Updated 4 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Aug 2, 2023Updated 2 years ago
- Meta Representation Transformation for Low-resource Cross-lingual Learning☆41May 5, 2021Updated 4 years ago
- Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations☆133Aug 6, 2025Updated 6 months ago
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- Nordlys: Toolkit for entity-oriented and semantic search☆31Mar 23, 2021Updated 4 years ago
- Monorepo for Static Fuse Gatsby Themes☆31Jan 4, 2023Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆80Feb 16, 2022Updated 4 years ago
- ☆89Apr 3, 2025Updated 10 months ago
- ☆10Jun 19, 2024Updated last year