castorini / onboardingLinks
Onboarding guide to Jimmy Lin's research group at the University of Waterloo
☆34Updated last month
Alternatives and similar repositories for onboarding
Users that are interested in onboarding are comparing it to the libraries listed below
Sorting:
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆351Updated last year
- Search Engines with Autoregressive Language models☆291Updated 2 years ago
- Provides a common interface to many IR ranking datasets.☆372Updated 2 weeks ago
- Inquisitive Parrots for Search☆197Updated 3 months ago
- Scalable training for dense retrieval models.☆298Updated 3 months ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆172Updated 4 years ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆331Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆113Updated 4 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆253Updated 3 years ago
- Train Dense Passage Retriever (DPR) with a single GPU☆133Updated 4 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆329Updated 2 years ago
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆471Updated this week
- Unified Learned Sparse Retrieval Framework☆66Updated last year
- A simple toolkit to process TREC files in Python.☆173Updated last year
- Dense hybrid representations for text retrieval☆63Updated 2 years ago
- docTTTTTquery document expansion model☆369Updated 2 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆279Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆145Updated last year
- ☆323Updated 4 years ago
- Long Document Summarization Papers☆150Updated 2 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- Retrieval-Augmented Generation battle!☆58Updated last month
- GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings☆44Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆79Updated 3 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆119Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆207Updated 4 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆264Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆101Updated 2 years ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…☆50Updated last year