UKPLab / useb
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in the TSDAE paper: https://arxiv.org/abs/2104.06979.
☆32Updated 3 years ago
Alternatives and similar repositories for useb:
Users that are interested in useb are comparing it to the libraries listed below
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- A library to conduct ranking experiments with transformers.☆161Updated last year
- ☆75Updated 3 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆169Updated 3 years ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆59Updated 3 years ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆44Updated last year
- Dense hybrid representations for text retrieval☆62Updated last year
- ☆54Updated 2 years ago
- Anserini notebooks☆69Updated last year
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- A toolkit for end-to-end neural ad hoc retrieval☆95Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ☆55Updated 2 years ago
- ☆21Updated 3 years ago
- ☆37Updated 2 years ago
- source code of bison☆26Updated 4 years ago
- ☆68Updated 3 years ago
- ☆84Updated 7 months ago
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- ☆45Updated 3 years ago
- ☆97Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- ☆38Updated 3 months ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆29Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 2 years ago
- Open source library for few shot NLP☆77Updated last year
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆123Updated 3 years ago