Alibaba-NLP / Multi-CPRLinks
[SIGIR 2022] Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval
☆200Updated 2 years ago
Alternatives and similar repositories for Multi-CPR
Users that are interested in Multi-CPR are comparing it to the libraries listed below
Sorting:
- T2Ranking: A large-scale Chinese benchmark for passage ranking.☆161Updated 2 years ago
- text embedding☆147Updated 2 years ago
- 比Sentence-BERT更有效的句向量方案☆376Updated 3 years ago
- CoSENT、STS、SentenceBERT☆172Updated 10 months ago
- 3000000+语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型☆312Updated 3 years ago
- 收集了目前为止中文领域的MRC抽取式数据集☆122Updated last year
- experiments of some semantic matching models and comparison of experimental results.☆163Updated 2 months ago
- Mengzi Pretrained Models☆540Updated 3 years ago
- 中文数据集下SimCSE+ESimCSE的实现☆193Updated 3 years ago
- 飞桨可信AI☆189Updated 2 years ago
- A simple framework for building some basic NLP tasks☆59Updated 3 years ago
- 中文 Instruction tuning datasets☆141Updated last year
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆436Updated 5 years ago
- 中文自然语言推理与语义相似度数据集☆367Updated 3 years ago
- sentence-transformers to onnx 让sbert模型推理效率更快☆168Updated 3 years ago
- NLP句子编码、句子embedding、语义相似度:BERT_avg、BERT_whitening、SBERT、SmiCSE☆179Updated 3 years ago
- CCL 2022 汉语学习者文本纠错评测☆142Updated 3 years ago
- Pattern-Exploiting Training在中文上的简单实验☆174Updated 5 years ago
- 句子匹配模型,包括无监督的SimCSE、ESimCSE、PromptBERT,和有监督的SBERT、CoSENT。☆100Updated 3 years ago
- SimCSE有监督与无监督实验复现☆152Updated last year
- A framework for cleaning Chinese dialog data☆274Updated 4 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆65Updated 4 years ago
- 真 · “Deep Learning for Humans”☆142Updated 4 years ago
- SimBERT升级版(SimBERTv2)!☆445Updated 3 years ago
- SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding☆226Updated 2 years ago
- ☆48Updated 2 years ago
- ☆271Updated last year
- 中文bigbird预训练模型☆96Updated 3 years ago
- Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021☆241Updated 3 years ago
- 中文机器阅读理解数据集☆109Updated 4 years ago