PaddlePaddle / RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
☆777Updated last year
Alternatives and similar repositories for RocketQA:
Users that are interested in RocketQA are comparing it to the libraries listed below
- Unified Structure Generation for Universal Information Extraction☆929Updated 2 years ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆294Updated 2 years ago
- PromptCLUE, 全中文任务支持零样本学习模型☆663Updated last year
- Mengzi Pretrained Models☆536Updated 2 years ago
- PaddleNLP UIE模型的PyTorch版实现☆628Updated last year
- a bert for retrieval and generation☆857Updated 4 years ago
- unified embedding model☆854Updated last year
- SimBERT升级版(SimBERTv2)!☆441Updated 3 years ago
- CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation☆487Updated 2 years ago
- Open Language Pre-trained Model Zoo☆997Updated 3 years ago
- TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLO…☆962Updated 7 months ago
- 比Sentence-BERT更有效的句向量方案☆371Updated 2 years ago
- Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle☆672Updated last year
- Revisiting Pre-trained Models for Chinese Natural Language Processing (MacBERT)☆670Updated 2 years ago
- Collections of resources from Joint Laboratory of HIT and iFLYTEK Research (HFL)☆367Updated 2 years ago
- 简单的向量白化改善句向量质量☆484Updated 3 years ago
- Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料☆954Updated 2 years ago
- 中文生成式预训练模型☆566Updated 3 years ago
- [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆625Updated last year
- Pre-trained Chinese ELECTRA(中文ELECTRA预训练模型)☆1,416Updated 2 years ago
- pCLUE: 1000000+多任务提示学习数据集☆490Updated 2 years ago
- The online version is temporarily unavailable because we cannot afford the key. You can clone and run it locally. Note: we set defaul ope…☆815Updated 10 months ago
- FewCLUE 小样本学习测评基准,中文版☆506Updated 2 years ago
- MiniRBT (中文小型预训练模型系列)☆276Updated 2 years ago
- 中文自然语言推理与语义相似度数据集☆349Updated 3 years ago
- [SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval☆184Updated 2 years ago
- ☆415Updated last year
- A framework for cleaning Chinese dialog data☆269Updated 3 years ago
- ☆440Updated last month
- SimCSE在中文任务上的简单实验☆605Updated last year