castorini / pygaggle
a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini
☆350Updated last year
Alternatives and similar repositories for pygaggle:
Users that are interested in pygaggle are comparing it to the libraries listed below
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆323Updated last year
- docTTTTTquery document expansion model☆364Updated 2 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆262Updated 2 years ago
- Build Text Rerankers with Deep Language Models☆262Updated last year
- EMNLP 2021 - Pre-training architectures for dense retrieval☆251Updated 3 years ago
- Search Engines with Autoregressive Language models☆284Updated 2 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆319Updated last year
- A simple toolkit to process TREC files in Python.☆167Updated 8 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 2 years ago
- A multilingual version of MS MARCO passage ranking dataset☆144Updated last year
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆306Updated last year
- Data and models for the SciFact verification task.☆229Updated last year
- NAACL2021 - COIL Contextualized Lexical Retriever☆152Updated 3 years ago
- Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper☆296Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Scalable training for dense retrieval models.☆292Updated last month
- Fusion-in-Decoder☆566Updated last year
- ☆479Updated 3 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆124Updated 3 years ago
- Provides a common interface to many IR ranking datasets.☆352Updated last week
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆226Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆333Updated last year
- ☆345Updated 3 years ago
- A library to conduct ranking experiments with transformers.☆161Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆110Updated 3 years ago
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆371Updated last year
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- An Open-Source Package for Information Retrieval.☆448Updated 2 years ago
- ☆82Updated last year
- Multi-hop dense retrieval for question answering☆213Updated 3 years ago