thunlp / OpenMatch
An Open-Source Package for Information Retrieval.
☆448Updated last year
Related projects: ⓘ
- A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks☆359Updated last year
- ☆475Updated 2 years ago
- A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Re…☆315Updated last year
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆312Updated 3 years ago
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"☆371Updated last year
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 2 years ago
- ACL2020 Tutorial: Open-Domain Question Answering☆834Updated 3 years ago
- ☆442Updated last year
- Semantics-aware BERT for Language Understanding (AAAI 2020)☆285Updated last year
- docTTTTTquery document expansion model☆350Updated last year
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆525Updated 2 years ago
- Code for using and evaluating SpanBERT.☆884Updated last year
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆714Updated 2 years ago
- MixText: Linguistically-Informed Interpolation of Hidden Space for Semi-Supervised Text Classification☆349Updated 4 years ago
- Autoregressive Entity Retrieval☆756Updated last year
- Training & evaluation library for text-based neural re-ranking and dense retrieval models built with PyTorch☆259Updated last year
- Source code for transferable dialogue state generator (TRADE, Wu et al., 2019). https://arxiv.org/abs/1905.08743☆391Updated last year
- Code for CEDR: Contextualized Embeddings for Document Ranking, accepted at SIGIR 2019.☆156Updated 3 years ago
- a gaggle of deep neural architectures for text ranking and question answering, designed for Pyserini☆338Updated 8 months ago
- MS MARCO(Microsoft Machine Reading Comprehension) is a large scale dataset focused on machine reading comprehension and question answerin…☆208Updated last year
- Facilitating the design, comparison and sharing of deep text matching models.☆496Updated 4 months ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆168Updated last year
- ☆292Updated last year
- TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)☆528Updated 3 years ago
- ☆165Updated last year
- Resources for the MRQA 2019 Shared Task☆290Updated 3 years ago
- ☆197Updated last year
- [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach☆294Updated 2 years ago
- Build Text Rerankers with Deep Language Models☆245Updated 7 months ago