sebastian-hofstaetter / colberter
☆45Updated 2 years ago
Alternatives and similar repositories for colberter:
Users that are interested in colberter are comparing it to the libraries listed below
- Dense hybrid representations for text retrieval☆62Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆44Updated last year
- ☆84Updated 6 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆22Updated last year
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 3 years ago
- One-stop shop for running and fine-tuning transformer-based language models for retrieval☆47Updated this week
- ☆36Updated 2 years ago
- ☆55Updated 2 years ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆59Updated 3 years ago
- A toolkit for asynchronously validating dense retriever checkpoints during training.☆27Updated last year
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆74Updated 3 years ago
- Unified Learned Sparse Retrieval Framework☆64Updated 9 months ago
- ☆36Updated last week
- ☆29Updated last year
- ☆38Updated 2 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆18Updated 3 weeks ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- ☆96Updated 2 years ago
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆168Updated 3 years ago
- Cross language information retrieval pipeline☆18Updated last year
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- ☆15Updated 2 months ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆28Updated 2 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.☆59Updated last year
- A toolkit for building dense retrievers with deep language models.☆57Updated 3 years ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆40Updated 4 months ago
- provides a common interface to many IR measure tools☆81Updated last week
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year