IBM / ColBERT-practical
Code and scripts for NAACL 2022 industry track paper "Fast and Light-weight Answer Text Retrieval in Dialogue Systems". Built on top of ColBERT (https://github.com/stanford-futuredata/ColBERT).
☆13Updated last year
Related projects: ⓘ
- ☆26Updated 10 months ago
- Zero-shot Document Ranking with Large Language Models.☆88Updated 2 months ago
- ☆33Updated 3 months ago
- ☆12Updated 2 years ago
- An easy-to-use python toolkit for flexibly adapting various neural ranking models to any target domain.☆55Updated last year
- Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking☆65Updated last year
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆88Updated last week
- 🦮 Code and pretrained models for Findings of ACL 2022 paper "LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrie…☆49Updated 2 years ago
- An Open-Source Package for Information Retrieval☆145Updated last month
- A simple example for finetuning HuggingFace T5 model. Includes code for intermediate generation.☆27Updated 3 years ago
- Deep Keyphrase Generation with Pre-trained Language Models☆23Updated 6 months ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆130Updated last year
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆39Updated 2 months ago
- SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling☆58Updated 3 years ago
- source code for paper: WhiteningBERT: An Easy Unsupervised Sentence Embedding Approach.☆56Updated 3 years ago
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆46Updated 8 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆27Updated 2 months ago
- ☆4Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆77Updated last year
- This is the code for our KILT leaderboard submissions (KGI + Re2G models).☆146Updated last year
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022☆88Updated 3 months ago
- ☆10Updated 3 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"☆91Updated last year
- Code repo for ACL22 paper "DeepStruct: Pretraining of Language Models for Structure Prediction"☆80Updated last year
- ☆32Updated 2 months ago
- Build Text Rerankers with Deep Language Models☆245Updated 7 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆47Updated 5 months ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021☆55Updated 2 years ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆36Updated 5 months ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated last year