中文文本语义相似度(Chinese Semantic Text Similarity)语料库建设
☆482Mar 7, 2018Updated 7 years ago
Alternatives and similar repositories for ChineseSTS
Users that are interested in ChineseSTS are comparing it to the libraries listed below
Sorting:
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- a bert for retrieval and generation☆860Feb 26, 2021Updated 5 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆470May 8, 2022Updated 3 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,984Nov 21, 2022Updated 3 years ago
- 问题句子相似度计算,即给定客服里用户描述的两句话,用算法来判断是否表示了相同的语义。☆370Nov 17, 2020Updated 5 years ago
- 一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算☆1,671Oct 14, 2019Updated 6 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,862Feb 6, 2026Updated 3 weeks ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,b…☆1,539Sep 23, 2021Updated 4 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,183Oct 30, 2023Updated 2 years ago
- 基于siamese-lstm的中文句子相似度计算☆129Jul 1, 2018Updated 7 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确。☆744Feb 16, 2022Updated 4 years ago
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,787Feb 18, 2023Updated 3 years ago
- Text-Similarity Method in Pytorch☆469Dec 9, 2018Updated 7 years ago
- 常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中☆674Oct 12, 2019Updated 6 years ago
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,177Jul 15, 2025Updated 7 months ago
- 中文近义词:聊天机器人,智能问答工具包☆5,104Feb 1, 2026Updated last month
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,575Nov 21, 2023Updated 2 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆217Aug 12, 2016Updated 9 years ago
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆365Dec 15, 2018Updated 7 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,386May 31, 2022Updated 3 years ago
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,051Sep 23, 2020Updated 5 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆176Dec 15, 2018Updated 7 years ago
- 自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等☆1,728Jul 18, 2022Updated 3 years ago
- 中文短文句相似读☆137Jun 14, 2018Updated 7 years ago
- FAQ-based Question Answering System☆2,581Nov 28, 2020Updated 5 years ago
- A very simple BiLSTM-CRF model for Chinese Named Entity Recognition 中文命名实体识别 (TensorFlow)☆2,338Apr 18, 2022Updated 3 years ago
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆435Feb 10, 2020Updated 6 years ago
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,925Jul 27, 2023Updated 2 years ago
- 高质量中文预训练模型集合:最先进大模型、最快小模型、相似度专门模型☆816Jul 8, 2020Updated 5 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- ccks baidu entity link 实体链接 第一名☆843Dec 19, 2023Updated 2 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,232Feb 6, 2026Updated 3 weeks ago
- SMP2017中文人机对话评测数据☆108Oct 19, 2017Updated 8 years ago
- This repo contains our ACL 2017 paper data and source code☆730Sep 15, 2020Updated 5 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Open Chinese Language Pre-trained Model Zoo☆984Mar 18, 2020Updated 5 years ago