microsoft / msmarco
website for MS Marco
☆27Updated 2 months ago
Alternatives and similar repositories for msmarco:
Users that are interested in msmarco are comparing it to the libraries listed below
- Submission archive for the MS MARCO document ranking leaderboard☆28Updated last year
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated last year
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Updated 3 years ago
- A Python framework for conversational search☆40Updated 3 years ago
- Generative Retrieval Transformer☆28Updated last year
- Dataset and code for three Web crawling-related papers from SIGIR-2019, NeurIPS-2019. and ICML-2020.☆39Updated last week
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.☆35Updated 4 years ago
- ☆97Updated 2 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- ☆16Updated last year
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆17Updated last month
- A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP☆107Updated 2 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆96Updated 4 months ago
- ☆42Updated 5 years ago
- ☆45Updated 2 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- Train transformer-based models.☆28Updated last month
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆123Updated 3 years ago
- ☆16Updated 5 months ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆31Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- Semantically Structured Sentence Embeddings☆66Updated 3 months ago
- Fusion for TREC run files with popular fusion techniques☆22Updated 2 years ago
- Implementation of pQRNN in PyTorch☆46Updated 3 years ago
- Wikipedia based dataset to train relationship classifiers and fact extraction models☆25Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆106Updated 2 years ago
- Code for paper "AnswerQuest: A System for Generating Question-Answer Items from Multi-Paragraph Documents"☆19Updated last year
- An easy to use framework for large-scale fact-checking and question answering☆69Updated last year