baipengyan / Chinese-StopWordsLinks
中文常用的停用词(包含百度、哈工大、四川大学等词表)
☆32Updated 6 years ago
Alternatives and similar repositories for Chinese-StopWords
Users that are interested in Chinese-StopWords are comparing it to the libraries listed below
Sorting:
- 大连理工大学情感词汇本体库及其他相关操作☆141Updated 8 years ago
 - 适用于中文分词的经济金融词典☆87Updated 4 years ago
 - 人民日报语料处理工具集 | Tools for Corpus of People's Daily☆285Updated 2 years ago
 - 中文文本分析工具、语料、预训练模型相关资源汇总。☆143Updated last month
 - 使用SO_PMI互信息算法、词向量法快速构建不同领域(手机、汽车等)的专业情感词典☆93Updated 3 years ago
 - Chinese Sentiment Analysis 中文文本情感分析☆189Updated 2 years ago
 - 该仓库收集了常用的中文情感词典,仅供学习☆132Updated last year
 - Self complemented sentiment words expansion using seed sentiment words and so-pmi , this method is tested to be effective, 基于情感种子词与so-pmi…☆87Updated 7 years ago
 - 常用的中文停用词表☆79Updated 7 years ago
 - An exploration for Eventline (important news Rank organized by pulic time),针对某一事件话题下的新闻报道集合,通过使用docrank算法,对新闻报道进行重要性识别,并通过新闻报道时间挑选出时间线上重要…☆224Updated 7 years ago
 - Core Data of HowNet and OpenHowNet Python API☆627Updated 3 years ago
 - Code for Chinese LIWC Lexicon Expansion via Hierarchical Classification of Word Embeddings with Sememe Attention (AAAI18)☆158Updated 7 years ago
 - 维基百科中文语料整理☆301Updated 7 years ago
 - 中文微博语料库 情感二分类☆280Updated 5 years ago
 - ASAP: A Chinese Review Dataset Towards Aspect Category Sentiment Analysis and Rating Prediction☆339Updated 4 years ago
 - Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量☆456Updated 6 years ago
 - SmoothNLP 金融文本数据集(公开) Public Financial Datasets for NLP Researches Only☆490Updated 6 years ago
 - Law Crime Mining Based on Corpus build and content analysis by NLP methods. 基于领域语料库构建与NLP方法的裁判文书与犯罪案例文本挖掘项目☆351Updated 6 years ago
 - 中文突发事件语料库(Chinese Emergency Corpus)-上海大学-语义智能实验室☆716Updated 6 years ago
 - 提供一款中文版生成式摘要服务☆349Updated 3 weeks ago
 - [COLING 2022] CSL: A Large-scale Chinese Scientific Literature Dataset 中文科学文献数据集☆647Updated 2 years ago
 - 中文文本摘要(text summarization)工具包, 抽取式中文文本摘要 Extractive text summary of Lead3、keyword、textrank、text teaser、word significance、LDA、LSI、NMF。(gra…☆420Updated last year
 - 基于哈工大同义词词林扩展版的单词相似度计算方法☆371Updated 2 years ago
 - ☆59Updated 4 years ago
 - 搜集、整理、发布 中文 自然语言处理 语料/数据集,与 有志之士 共同 促进 中文 自然语言处理 的 发展。☆98Updated 6 years ago
 - Sequential Event Experiment based on Travel note crawled from XieCheng,基于50W携程出行游记的采集与顺承事件图谱构建.☆187Updated 6 years ago
 - SMP 2020年微博情感分类评测任务 第六名解决方案☆69Updated 3 years ago
 - We released BERT-wwm, a Chinese pre-training model based on Whole Word Masking technology, and models closely related to this technology.…☆63Updated 2 years ago
 - Chinese Sentiment Classification Tool. 情感极性分类,基于知网、清华、BosonNLP情感词典,易扩展,基准方法,开箱即用。☆99Updated 2 years ago
 - 中文近义词表 Chinese Synonyms☆262Updated 7 years ago