castorini / onboarding
Onboarding guide to Jimmy Lin's research group at the University of Waterloo
☆25Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for onboarding
- ☆55Updated last year
- ☆315Updated 3 years ago
- ☆45Updated 2 years ago
- ☆29Updated 9 months ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆118Updated 3 months ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆30Updated last year
- Dense hybrid representations for text retrieval☆61Updated last year
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆71Updated 2 years ago
- Inquisitive Parrots for Search☆177Updated 8 months ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆122Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated 11 months ago
- Train Dense Passage Retriever (DPR) with a single GPU☆128Updated 3 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.