microsoft / msmarco
website for MS Marco
☆28Updated last week
Alternatives and similar repositories for msmarco:
Users that are interested in msmarco are comparing it to the libraries listed below
- Submission archive for the MS MARCO document ranking leaderboard☆29Updated last year
- A toolkit for end-to-end neural ad hoc retrieval☆95Updated 7 months ago
- ☆42Updated 5 years ago
- Generative Retrieval Transformer☆28Updated last year
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Updated 4 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆123Updated 3 years ago
- Viewer for the 🤗 datasets library.☆84Updated 3 years ago
- Question-answers, collected from Google☆128Updated 3 years ago
- ☆97Updated 2 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- A Python framework for conversational search☆40Updated 3 years ago
- Anserini notebooks☆69Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- This is the code for loading the SenseBERT model, described in our paper from ACL 2020.☆44Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- Dataset and code for three Web crawling-related papers from SIGIR-2019, NeurIPS-2019. and ICML-2020.☆39Updated 2 months ago
- ☆75Updated 3 years ago
- A multi-stage neural search engine for the COVID-19 Open Research Dataset☆137Updated 2 years ago
- ☆84Updated 7 months ago
- source code of bison☆26Updated 4 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆31Updated last year
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆108Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Multi-stage passage ranking: monoBERT + duoBERT☆112Updated 4 years ago
- ☆32Updated 3 years ago
- An open source toolkit for multimodal generative conversational task assistants, helping assist people with real-world complex tasks☆34Updated 10 months ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answerin…☆215Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago