lemurproject / ClueWeb22Links
☆16Updated last year
Alternatives and similar repositories for ClueWeb22
Users that are interested in ClueWeb22 are comparing it to the libraries listed below
Sorting:
- Unified Learned Sparse Retrieval Framework☆67Updated last year
- ☆17Updated 3 years ago
- A toolkit for building dense retrievers with deep language models.☆64Updated 4 years ago
- ☆39Updated 2 years ago
- A Python framework for conversational search☆40Updated 4 years ago
- Query-focused summarization data☆42Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆79Updated 3 years ago
- ☆46Updated 3 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆42Updated 2 years ago
- Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"☆42Updated 4 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated 2 years ago
- Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".☆82Updated this week
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆86Updated 2 years ago
- ☆49Updated 2 years ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆69Updated last year
- ☆50Updated 2 years ago
- ☆144Updated 11 months ago
- ☆97Updated 3 years ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆155Updated last year
- EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers https://arxiv.org/abs/2109.08535☆146Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆105Updated 2 years ago
- A unified benchmark for math reasoning☆89Updated 2 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆120Updated 3 years ago
- ☆54Updated 2 years ago
- Dense hybrid representations for text retrieval☆63Updated 2 years ago
- ☆55Updated 2 years ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆64Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Updated 2 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆156Updated 3 years ago
- ☆101Updated 3 years ago