YueYongDev / stopwords
常用中文停用词表及对比
☆71Updated 6 years ago
Alternatives and similar repositories for stopwords:
Users that are interested in stopwords are comparing it to the libraries listed below
- 基于ltp的简单评论观点抽取模块☆116Updated 6 years ago
- 各大中文分词性能评测☆157Updated 6 years ago
- Word similarity computation based on Tongyici Cilin☆119Updated 7 years ago
- This is a corpus of Chinese abbreviation, including negative full forms.☆196Updated 3 years ago
- ChineseTextualInference project including chinese corpus build and inferecence model, 中文文本推断项目,包括88万文本蕴含中文文本蕴含数据集的翻译与构建,基于深度学习的文本蕴含判定模型构建…☆172Updated 6 years ago
- 常用的中文停用词表☆75Updated 7 years ago
- 中国法研杯-司法人工智能挑战赛☆91Updated 6 years ago
- 基于哈工大同义词词林扩展版的单词相似度计算方法☆364Updated last year
- E-Commerce Sentiment Dict☆129Updated 6 years ago
- Chinese Subjective Dectection based on subjective knowlegebase, 中文主观性计算。基于中文主观性知识库的句子主观性评定方法。☆57Updated last year
- Dataset from 'Character-based BiLSTM-CRF Incorporating POS and Dictionaries for Chinese Opinion Target Extraction'☆43Updated 6 years ago
- 教育行业新闻 自动文摘 语料库 自动摘要☆199Updated 6 years ago
- HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示☆171Updated 6 years ago
- NER(命名实体识别)中文语料,一站式获取☆128Updated 5 years ago
- 新词发现算法(NewWordDetection)☆92Updated 4 years ago
- 转换搜狗拼音词库为txt文件☆50Updated 7 years ago
- 维基百科中文语料整理☆297Updated 7 years ago
- THU Chinese Keyphrase Extraction Toolkit☆125Updated 7 years ago
- 新词发现 基于词频、凝聚系数和左右邻接信息熵☆122Updated 5 years ago
- ChineseHumorSentiment, chinese humor sentiment mining including corpus build and mining nlp methods.中文文本幽默情绪计算项目,项目包括幽默文本语料库的构建,幽默计算模型,包括…☆122Updated 6 years ago
- 李傲龍的博客☆81Updated 9 months ago
- NLP NER datasets video/music/book bio☆88Updated 4 years ago
- 中文分词工具评估☆61Updated 2 years ago
- 夸夸语料,来自豆瓣互相表扬组数据☆75Updated 6 years ago
- chinese and english corpus process script, python, c++, java☆197Updated 6 years ago
- 法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军队伍☆248Updated 3 years ago
- Word Similarity and Word Analogy Task scripts☆70Updated 6 years ago
- 使用BERT模型进行文本分类,相似句子判断,以及词性标注☆89Updated 6 years ago
- 无监督观点聚类。通过依存关系进行观点提取,对观点进行相似度计算,对已经生成的观点聚类☆47Updated 6 years ago
- Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量☆455Updated 6 years ago