pluto-junzeng/CNSD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pluto-junzeng/CNSD)

pluto-junzeng / CNSD

中文自然语言推理数据集（A large-scale Chinese Nature language inference and Semantic similarity calculation Dataset）

☆434

Alternatives and similar repositories for CNSD

Users that are interested in CNSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pluto-junzeng / ChineseSquad
View on GitHub
中文机器阅读理解数据集
☆108Mar 29, 2021Updated 5 years ago
zhoujx4 / NLP-Series-sentence-embeddings
View on GitHub
NLP句子编码、句子embedding、语义相似度：BERT_avg、BERT_whitening、SBERT、SmiCSE
☆178Dec 29, 2021Updated 4 years ago
vdogmcgee / SimCSE-Chinese-Pytorch
View on GitHub
SimCSE在中文上的复现，有监督+无监督
☆281Feb 21, 2025Updated last year
KwangKa / SIMCSE_unsup
View on GitHub
中文无监督SimCSE Pytorch实现
☆134Jul 8, 2021Updated 5 years ago
bojone / SimCSE
View on GitHub
SimCSE在中文任务上的简单实验
☆605Aug 7, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
CLUEbenchmark / CLUE
View on GitHub
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
☆4,271Feb 6, 2026Updated 5 months ago
CLUEbenchmark / OCNLI
View on GitHub
OCNLI: 中文原版自然语言推理任务
☆167Sep 23, 2021Updated 4 years ago
zejunwang1 / CSTS
View on GitHub
中文自然语言推理与语义相似度数据集
☆366Jan 5, 2022Updated 4 years ago
zhengyanzhao1997 / NLP-model
View on GitHub
☆278Apr 14, 2026Updated 3 months ago
xinyi-code / SimCSE-Pytorch
View on GitHub
中文数据集下SimCSE+ESimCSE的实现
☆190May 21, 2022Updated 4 years ago
princeton-nlp / SimCSE
View on GitHub
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
☆3,655Oct 16, 2024Updated last year
liucongg / NLPDataSet
View on GitHub
记录本人整理的一些数据集
☆1,092Jun 16, 2022Updated 4 years ago
ZhuiyiTechnology / pretrained-models
View on GitHub
Open Language Pre-trained Model Zoo
☆1,003Nov 18, 2021Updated 4 years ago
CLUEbenchmark / CLUEDatasetSearch
View on GitHub
搜索所有中文NLP数据集，附常用英文NLP数据集
☆4,459Nov 21, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bojone / CoSENT
View on GitHub
比Sentence-BERT更有效的句向量方案
☆373Nov 9, 2022Updated 3 years ago
yangjianxin1 / SimCSE
View on GitHub
SimCSE有监督与无监督实验复现
☆151Feb 22, 2024Updated 2 years ago
IceFlameWorm / NLP_Datasets
View on GitHub
中文NLP数据集
☆159Jul 24, 2019Updated 6 years ago
dbiir / UER-py
View on GitHub
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
☆3,110May 9, 2024Updated 2 years ago
ymcui / Chinese-BERT-wwm
View on GitHub
Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）
☆10,223Apr 19, 2026Updated 3 months ago
ZhuiyiTechnology / simbert
View on GitHub
a bert for retrieval and generation
☆860Feb 26, 2021Updated 5 years ago
brightmart / albert_zh
View on GitHub
A LITE BERT FOR SELF-SUPERVISED LEARNING OF LANGUAGE REPRESENTATIONS, 海量中文预训练ALBERT模型
☆3,979Nov 21, 2022Updated 3 years ago
brightmart / nlp_chinese_corpus
View on GitHub
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
☆9,904Feb 6, 2026Updated 5 months ago
taishan1994 / chinese_sentence_embeddings
View on GitHub
bert_avg，bert_whitening，sbert，consert，simcse，esimcse 中文句向量表示
☆15Apr 7, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
425776024 / nlpcda
View on GitHub
一键中文数据增强包； NLP数据增强、bert数据增强、EDA：pip install nlpcda
☆1,880Mar 18, 2025Updated last year
liuhuanyong / ChineseTextualInference
View on GitHub
ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…
☆174Dec 15, 2018Updated 7 years ago
bojone / BERT-whitening
View on GitHub
简单的向量白化改善句向量质量
☆486Jun 17, 2021Updated 5 years ago
ChineseGLUE / ChineseGLUE
View on GitHub
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
☆1,783Feb 18, 2023Updated 3 years ago
ymcui / Chinese-ELECTRA
View on GitHub
Pre-trained Chinese ELECTRA（中文ELECTRA预训练模型）
☆1,433Apr 19, 2026Updated 3 months ago
luhua-rain / MRC_Competition_Dureader
View on GitHub
机器阅读理解冠军/亚军代码及中文预训练MRC模型
☆743Nov 19, 2022Updated 3 years ago
shawroad / CoSENT_Pytorch
View on GitHub
CoSENT、STS、SentenceBERT
☆170Feb 11, 2025Updated last year
LianjiaTech / BELLE
View on GitHub
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
☆8,273Oct 16, 2024Updated last year
InsaneLife / ChineseNLPCorpus
View on GitHub
中文自然语言处理数据集，平时做做实验的材料。欢迎补充提交合并。
☆4,602Nov 21, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thu-coai / CDial-GPT
View on GitHub
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
☆1,956Jun 12, 2023Updated 3 years ago
brightmart / roberta_zh
View on GitHub
RoBERTa中文预训练模型: RoBERTa for Chinese
☆2,793Jul 22, 2024Updated last year
thu-coai / CrossWOZ
View on GitHub
A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset
☆721Jun 17, 2024Updated 2 years ago
IAdmireu / ChineseSTS
View on GitHub
中文文本语义相似度（Chinese Semantic Text Similarity）语料库建设
☆478Mar 7, 2018Updated 8 years ago
loujie0822 / DeepIE
View on GitHub
DeepIE: Deep Learning for Information Extraction
☆1,937Dec 9, 2022Updated 3 years ago
airaria / TextBrewer
View on GitHub
A PyTorch-based knowledge distillation toolkit for natural language processing
☆1,705May 8, 2023Updated 3 years ago
zhaogaofeng611 / TextMatch
View on GitHub
基于Pytorch的，中文语义相似度匹配模型（ABCNN、Albert、Bert、BIMPM、DecomposableAttention、DistilBert、ESIM、RE2、Roberta、SiaGRU、XlNet）
☆797Mar 22, 2020Updated 6 years ago