PaddlePaddle/RocketQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PaddlePaddle/RocketQA)

PaddlePaddle / RocketQA

🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

☆784

Alternatives and similar repositories for RocketQA

Users that are interested in RocketQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PaddlePaddle / PaddleNLP
View on GitHub
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
☆12,961May 23, 2026Updated 2 months ago
baidu / DuReader
View on GitHub
Baseline Systems of DuReader Dataset
☆1,178May 26, 2022Updated 4 years ago
facebookresearch / DPR
View on GitHub
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
☆1,869Apr 6, 2023Updated 3 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
Alibaba-NLP / Multi-CPR
View on GitHub
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
☆204Jan 4, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
luyug / Condenser
View on GitHub
EMNLP 2021 - Pre-training architectures for dense retrieval
☆256Mar 18, 2022Updated 4 years ago
RUCAIBox / DenseRetrieval
View on GitHub
☆220Dec 7, 2022Updated 3 years ago
PaddlePaddle / ERNIE
View on GitHub
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
☆7,724Jan 4, 2026Updated 6 months ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
castorini / pyserini
View on GitHub
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
☆2,102Jul 16, 2026Updated last week
beir-cellar / beir
View on GitHub
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
☆2,252Oct 16, 2025Updated 9 months ago
BDBC-KG-NLP / QA-Survey-CN
View on GitHub
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答（KBQA），基于文本的问答系统（TextQA），基于表格的问答系统（TableQA）、基于视觉的问答系统（VisualQA）和机器阅读理解（MRC）等，每类任务分别对…
☆1,816Apr 6, 2023Updated 3 years ago
THUIR / T2Ranking
View on GitHub
T2Ranking: A large-scale Chinese benchmark for passage ranking.
☆161Jul 3, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
jingtaozhan / DRhard
View on GitHub
SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
☆127Feb 15, 2022Updated 4 years ago
microsoft / AR2
View on GitHub
☆71Jun 16, 2022Updated 4 years ago
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,277Oct 16, 2024Updated last year
PaddlePaddle / Knover
View on GitHub
Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle
☆669Mar 6, 2024Updated 2 years ago
PaddlePaddle / Research
View on GitHub
novel deep learning research works with PaddlePaddle
☆1,757Aug 16, 2024Updated last year
luhua-rain / MRC_Competition_Dureader
View on GitHub
机器阅读理解冠军/亚军代码及中文预训练MRC模型
☆743Nov 19, 2022Updated 3 years ago
AlibabaResearch / HLATR
View on GitHub
Implementation of paper: HLATR: Enhance Multi-stage Text Retrieval with Hybrid List Aware Transformer Reranking
☆74Jan 4, 2023Updated 3 years ago
thunlp / OpenMatch
View on GitHub
An Open-Source Package for Information Retrieval.
☆442Oct 7, 2022Updated 3 years ago
shibing624 / pycorrector
View on GitHub
pycorrector is a toolkit for text error correction. 文本纠错，实现了Kenlm，T5，MacBERT，ChatGLM3，Qwen2.5等模型应用在纠错场景，开箱即用。
☆6,495Jun 4, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,273Feb 6, 2026Updated 5 months ago
huawei-noah / Pretrained-Language-Model
View on GitHub
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
☆3,162Jan 22, 2024Updated 2 years ago
luyug / Reranker
View on GitHub
Build Text Rerankers with Deep Language Models
☆265Feb 20, 2024Updated 2 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,224Apr 19, 2026Updated 3 months ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
FlagOpen / FlagEmbedding
View on GitHub
Retrieval and Retrieval-augmented LLMs
☆11,979Apr 22, 2026Updated 3 months ago
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆780Apr 7, 2023Updated 3 years ago
luyug / COIL
View on GitHub
NAACL2021 - COIL Contextualized Lexical Retriever
☆158Jul 27, 2021Updated 4 years ago
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,956Jun 12, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
princeton-nlp / DensePhrases
View on GitHub
[ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…
☆607Jun 15, 2022Updated 4 years ago
ielab / TILDE
View on GitHub
☆39Nov 21, 2022Updated 3 years ago
LeeSureman / Flat-Lattice-Transformer
View on GitHub
code for ACL 2020 paper: FLAT: Chinese NER Using Flat-Lattice Transformer
☆1,003May 10, 2022Updated 4 years ago
universal-ie / UIE
View on GitHub
Unified Structure Generation for Universal Information Extraction
☆958Jul 30, 2022Updated 3 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
PaddlePaddle / Paddle-bot
View on GitHub
☆17Jan 22, 2025Updated last year
baichuan-inc / Baichuan-7B
View on GitHub
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
☆5,650Jul 18, 2024Updated 2 years ago