UKPLab / useb
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
☆32Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for useb
- A library to conduct ranking experiments with transformers.☆161Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆123Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- ☆67Updated 3 years ago
- ☆37Updated 2 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆42Updated last year
- ☆73Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated last year
- State of the art Semantic Sentence Embeddings☆98Updated 2 years ago
- ☆55Updated last year
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- source code of bison☆26Updated 4 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆58Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- An Interactive Tool for Scalable and Reproducible Error Analysis.☆105Updated 3 years ago
- Multilingual abstractive summarization dataset extracted from WikiHow.☆85Updated 3 years ago
- PyTorch-IE: State-of-the-art Information Extraction in PyTorch☆76Updated last week
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆95Updated 3 months ago
- A Python framework for conversational search☆40Updated 3 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆139Updated 2 years ago
- Evaluation tools shared across anserini, pyserini, and pygaggle☆31Updated this week
- A multilingual version of MS MARCO passage ranking dataset☆142Updated last year
- XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning☆98Updated 3 years ago
- ☆85Updated 2 years ago
- Dataset for NAACL 2021 paper: "DART: Open-Domain Structured Data Record to Text Generation"☆149Updated 2 years ago