kuutsav / information-retrieval
Neural information retrieval / Semantic search / Bi-encoders
☆168Updated last year
Alternatives and similar repositories for information-retrieval:
Users that are interested in information-retrieval are comparing it to the libraries listed below
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆329Updated last year
- Efficient Attention for Long Sequence Processing☆92Updated last year
- A multilingual version of MS MARCO passage ranking dataset☆143Updated last year
- Search Engines with Autoregressive Language models☆283Updated last year
- Inquisitive Parrots for Search☆188Updated last year
- docTTTTTquery document expansion model☆361Updated last year
- Provides a common interface to many IR ranking datasets.☆346Updated last week
- Few-shot Named Entity Recognition☆123Updated 2 years ago
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆262Updated 2 years ago
- Build Text Rerankers with Deep Language Models☆261Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Some notebooks for NLP☆196Updated last year
- A library to conduct ranking experiments with transformers.☆161Updated last year
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆348Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆82Updated 4 months ago
- A Python framework for performing information retrieval experiments, building on http://terrier.org/☆438Updated 2 weeks ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆245Updated 3 years ago
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆322Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆153Updated last year
- Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation☆109Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- pytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.☆298Updated last year
- ☆84Updated 6 months ago
- Train Dense Passage Retriever (DPR) with a single GPU☆130Updated 3 years ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension, question answering, …☆315Updated last year
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- Scalable training for dense retrieval models.☆284Updated 3 weeks ago