常用的中文停用词表
☆80Apr 2, 2018Updated 7 years ago
Alternatives and similar repositories for ChineseStopWords
Users that are interested in ChineseStopWords are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Title and keywords are used to generate text.☆12Dec 6, 2021Updated 4 years ago
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- Language Collection☆14Dec 20, 2025Updated 3 months ago
- 基于词典的文本情感分析并且有用户界面“小白”☆10Jan 2, 2016Updated 10 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 6 years ago
- 更好的jieba java版☆21Apr 19, 2018Updated 7 years ago
- CCL2019,“小牛杯”中文幽默计算任务的数据集及baseline☆24Aug 27, 2024Updated last year
- in progress☆45Dec 9, 2015Updated 10 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- ☆83May 7, 2020Updated 5 years ago
- 快速搭建一个搜索引擎,示例程序☆10Aug 10, 2016Updated 9 years ago
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Jun 13, 2022Updated 3 years ago
- 伪原创相关☆14Sep 4, 2019Updated 6 years ago
- ☆14May 30, 2019Updated 6 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated 3 weeks ago
- ☆16May 23, 2020Updated 5 years ago
- ☆12May 19, 2021Updated 4 years ago
- 基于百度LAC项目的PHP中文智能分词库☆10Jun 25, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- ☆11Mar 30, 2021Updated 4 years ago
- Learning sentiment-specific word representations from tweets☆15Nov 21, 2015Updated 10 years ago
- 情感词典、停用词典、同义词典、程度词典、否定词典、敏感词典☆154Oct 7, 2021Updated 4 years ago
- Keras implementation of 'Convolutional Neural Networks for Sentence Classification. EMNLP 2014. Y. Kim.☆13Jan 20, 2017Updated 9 years ago
- Use requests to send HTTP raw sockets (To Test RFC Compliance)☆24Jun 22, 2024Updated last year
- 手动实现Elasticsearch的倒排索引以及BM25算法☆48Jan 9, 2019Updated 7 years ago
- 新闻文本自动摘要, 以Textrank 为基础,融入 标题特征,单句位置特征,重要实体特征,线索词特征,做句子的综合权重计算,并使用MMR算法,兼顾自动摘要的主题相关性和摘要多样性。☆26May 13, 2022Updated 3 years ago
- ☆12Jun 21, 2023Updated 2 years ago
- Source code for the Twitter Hybrid Sentiment Classifier used in Semeval 2014 competition. (Sentiment Analysis system)☆13May 20, 2014Updated 11 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- 中文问答系统:使用NLP相关技术,对搜索引擎,问答社区等进行信息抽取,文本概括等,支持通识问答,社区问答和部分专业问答☆32Jun 21, 2022Updated 3 years ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆322Jan 3, 2024Updated 2 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- An OpenFlow application for resilient multicast☆12Mar 30, 2017Updated 8 years ago
- 中文姓名与性别的相关性分析☆13May 16, 2016Updated 9 years ago
- 使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文文本的关键词匹配、停用词删除等。☆63Apr 29, 2020Updated 5 years ago
- 多任务学习相关资料,论文,代码☆15Jan 22, 2019Updated 7 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- Twitter Sentiment System for SemEval 2016☆11Mar 4, 2016Updated 10 years ago
- Convert digital images into georeferenced rasters☆14Mar 7, 2026Updated 2 weeks ago