grill-lab / DL-Hard
Deep Learning Hard (DL-HARD) is a new annotated dataset extending TREC Deep Learning benchmark.
☆36Updated 3 years ago
Alternatives and similar repositories for DL-Hard:
Users that are interested in DL-Hard are comparing it to the libraries listed below
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆58Updated 3 years ago
- ☆36Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆27Updated 2 years ago
- ☆67Updated 3 years ago
- Fusion for TREC run files with popular fusion techniques☆21Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆73Updated 2 years ago
- A Python framework for conversational search☆40Updated 3 years ago
- CODEC is a document and entity ranking dataset that focuses on complex essay-style topics.☆16Updated last month
- Tools for the TREC CAsT benchmark☆27Updated 2 years ago
- ☆45Updated 2 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Dense hybrid representations for text retrieval☆62Updated last year
- ☆23Updated last year
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆14Updated 3 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Updated 2 years ago
- ☆46Updated 5 years ago
- This dataset contains human judgements about answer equivalence. The data is based on SQuAD (Stanford Question Answering Dataset), and co…☆22Updated 2 years ago
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆41Updated 3 years ago
- This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".☆79Updated 3 years ago
- ☆54Updated 2 years ago
- Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"☆15Updated 2 years ago
- RepBERT is a competitive first-stage retrieval technique. It represents documents and queries with fixed-length contextualized embeddings…☆66Updated 3 years ago
- Code to support the paper "Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets"☆66Updated 3 years ago
- Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"☆25Updated last year
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated 2 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- Contrastive Fact Verification☆71Updated 2 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- Unified Learned Sparse Retrieval Framework☆63Updated 8 months ago
- Qulac: A dataset on asking Questions for Lack of Clarity in open-domain information-seeking conversations.☆73Updated 3 years ago