中文文本语义相似度(Chinese Semantic Text Similarity)语料库建设
☆482Mar 7, 2018Updated 8 years ago
Alternatives and similar repositories for ChineseSTS
Users that are interested in ChineseSTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Language Pre-trained Model Zoo☆1,005Nov 18, 2021Updated 4 years ago
- 一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算☆1,669Oct 14, 2019Updated 6 years ago
- a bert for retrieval and generation☆859Feb 26, 2021Updated 5 years ago
- 问题句子相似度计算,即给定客服里用户描述的两句话,用算法来判断是否表示了相同的语义。☆370Nov 17, 2020Updated 5 years ago
- 文本匹配的相关模型DSSM,ESIM,ABCNN,BIMPM等,数据集为LCQMC官方数据☆471May 8, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,b…☆1,539Sep 23, 2021Updated 4 years ago
- A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型☆3,983Nov 21, 2022Updated 3 years ago
- 大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP☆9,872Feb 6, 2026Updated last month
- 基于siamese-lstm的中文句子相似度计算☆129Jul 1, 2018Updated 7 years ago
- 常用文本匹配模型tf版本,数据集为QA_corpus,持续更新中☆674Oct 12, 2019Updated 6 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,855Aug 2, 2024Updated last year
- Text-Similarity Method in Pytorch☆469Dec 9, 2018Updated 7 years ago
- 100+ Chinese Word Vectors 上百种预训练中文词向量☆12,188Oct 30, 2023Updated 2 years ago
- 用TF特征向量和simhash指纹计算中文文本的相似度☆217Aug 12, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)☆10,179Jul 15, 2025Updated 8 months ago
- Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard☆1,786Feb 18, 2023Updated 3 years ago
- self complement of Sentence Similarity compute based on cilin, hownet, simhash, wordvector,vsm models,基于同义词词林,知网,指纹,字词向量,向量空间模型的句子相似度计算。☆365Dec 15, 2018Updated 7 years ago
- 中文近义词:聊天机器人,智能问答工具包☆5,104Feb 1, 2026Updated last month
- 利用预训练的中文模型实现基于bert的语义匹配模型 数据集为LCQMC官方数据☆198Dec 19, 2019Updated 6 years ago
- RoBERTa中文预训练模型: RoBERTa for Chinese☆2,773Jul 22, 2024Updated last year
- 《知网》中文词语语义相似度算法☆41Jun 6, 2013Updated 12 years ago
- 新词发现算法(NewWordDetection)☆63Sep 4, 2017Updated 8 years ago
- An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。☆1,385May 31, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 中文自然语言推理数据集(A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset)☆435Feb 10, 2020Updated 6 years ago
- 中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。☆4,581Nov 21, 2023Updated 2 years ago
- 综合了同义词词林扩展版与知网(Hownet)的词语相似度计算方法,词汇覆盖更多、结果更准确 。☆744Feb 16, 2022Updated 4 years ago
- A curated list of resources for Chinese NLP 中文自然语言处理相关资料☆7,928Jul 27, 2023Updated 2 years ago
- 中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard☆4,238Feb 6, 2026Updated last month
- 客服机器人,Chinese Retreival chatbot(中文检索式机器人)用到的一些QA模型,基于tf-idf,lsa,cnn,transformer,bert等☆444Apr 9, 2022Updated 3 years ago
- 中文短文句相似读☆136Jun 14, 2018Updated 7 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆176Dec 15, 2018Updated 7 years ago
- 简易的中文纠错和消歧☆289Aug 19, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 用于训练中英文对话系统的语料库 Datasets for Training Chatbot System☆2,051Sep 23, 2020Updated 5 years ago
- FAQ-based Question Answering System☆2,581Nov 28, 2020Updated 5 years ago
- 文本相似度计算/文本匹配☆309Feb 8, 2020Updated 6 years ago
- 简单的向量白化改善句向量质量☆487Jun 17, 2021Updated 4 years ago
- SimCSE在中文任务上的简单实验☆605Aug 7, 2023Updated 2 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,104May 9, 2024Updated last year
- This repo contains our ACL 2017 paper data and source code☆730Sep 15, 2020Updated 5 years ago