JiangYanting / CNKI_Chinese_paper_datasetLinks
中国知网论文数据集,24000+篇论文信息。自然语言处理、信息管理、文本分类、文本摘要、关键词抽取、研究热点分析、数据挖掘、数据分析
☆53Updated 5 months ago
Alternatives and similar repositories for CNKI_Chinese_paper_dataset
Users that are interested in CNKI_Chinese_paper_dataset are comparing it to the libraries listed below
Sorting:
- 文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT☆74Updated 9 months ago
- OpenTextClassification is all you need for text classification! Open text classification for everyone, enjoy your NLP journey! 这可能是目前为止最全…☆208Updated last year
- 各大文本摘要模型-中文文本可运行的解决方案☆69Updated last year
- Mimix: A Text Generation Tool and Pretrained Chinese Models☆157Updated 10 months ago
- pke_zh, python keyphrase extraction for chinese(zh). 中文关键词或关键句提取工具,实现了KeyBert、PositionRank、TopicRank、TextRank等算法,开箱即用。☆208Updated last year
- 中文文本相似度计算器☆158Updated 10 months ago
- 文本自动摘要☆93Updated 2 years ago
- 中文、分词、词表、核心词典、事件词表、停用词、敏感词、问答、问答数据、知识图谱、文本语料。☆168Updated 3 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆115Updated last year
- 文本相似度(匹配)计算,提供Baseline、训练、推理、指标分析...代码包含TensorFlow/Pytorch双版本☆179Updated 3 years ago
- 领域自适应文本挖掘工具(新词发现、情感分析、实体链接等),基于少量种子词和背景知识☆13Updated 6 years ago
- 超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题☆132Updated 3 years ago
- A Multi-Modal Dataset of Chinese Governmental Docunments☆35Updated 4 years ago
- Bert预训练模型fine-tune计算文本相似度☆107Updated 2 years ago
- Sentence-Transformers Information Retrieval example on Chinese☆30Updated last year
- 使用sentence-transformers(SBert)训练自己的文本相似度数据集并进行评估。☆49Updated 3 years ago
- 无监督中文关键词抽取(Keyphrase Extraction),基于统计,基于图【LDA与PageRank(TextRank, TPR, Salience Rank, Single TPR等)】,基于嵌入【SIFRank等】,开箱即用!☆108Updated 3 years ago
- SinglepassTextCluster, an TextCluster tools based on Singlepass cluster algorithm that use tfidf vector and doc2vec,which can be used for…☆64Updated 3 years ago
- RelExt: A Tool for Relation Extraction from Text. 文本实体关系抽取工具。☆50Updated 3 years ago
- 根据维基中文语料库预训练 GloVe 中文词向量;Pre-train GloVe word-embedding From Chinese Wiki corpus☆76Updated last year
- pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。☆514Updated 11 months ago
- 中文新词发现算法PNW算法,可以识别任意长度的新词。☆16Updated 2 years ago
- 2020年第八届泰迪杯数据挖掘C题“智慧政务文本挖掘”特等奖作品(论文与代码)☆67Updated 4 years ago
- 文本分类baseline:BERT、半监督学习UDA、对抗学习、数据增强☆104Updated 4 years ago
- Law Crime Mining Based on Corpus build and content analysis by NLP methods. 基于领域语料库构建与NLP方法的裁判文书与犯罪案例文本挖掘项目☆345Updated 6 years ago
- 基于pytorch_bert的中文多标签分类☆91Updated 3 years ago
- 中文图书数据集/数据挖掘/自然语言处理/中国图书分类法/图书情报学/数据挖掘/文本分类/☆91Updated 5 months ago
- FAQ智能问答系统。实现FAQ的问题-模板匹配功能。部署轻量级的Web服务应用。☆63Updated last year
- ☆29Updated last year
- 中文文本情感分类数据集分享 chinese sentiment datasets☆82Updated 4 years ago