常用的中文停用词表
☆82Apr 2, 2018Updated 8 years ago
Alternatives and similar repositories for ChineseStopWords
Users that are interested in ChineseStopWords are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an introduction to Chinese words segmentation using Jieba.☆15May 31, 2018Updated 8 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- 法律领域词典☆17Aug 30, 2019Updated 6 years ago
- Python version Aho-Corasic Automaton.☆19Jul 5, 2021Updated 4 years ago
- Self complemented text feature extraction using algorithms including CHI, DF, IG, MI for the experiment of text classification based on s…☆49Apr 18, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The way to code,the way to learn Pytorch☆12Aug 18, 2019Updated 6 years ago
- AI100竞赛:http://competition.ai100.com.cn/html/game_det.html?id = 24&tab = 1 的代码,主要用于文本分类,其中涉及CHI选择特征词,TFIDF计算权重,朴素贝叶斯,决策树,SVM,XGBoost等算法☆15Mar 27, 2019Updated 7 years ago
- 更好的jieba java版☆21Apr 19, 2018Updated 8 years ago
- CCL2019,“小牛杯”中文幽默计算任务的数据集及baseline☆25Aug 27, 2024Updated last year
- 以前的伪原创类,放这做个纪念,仅此。☆14Aug 8, 2017Updated 8 years ago
- English or Chinses GPT2Dialog model from GPT2-chitchat☆12Feb 23, 2020Updated 6 years ago
- Source code of the paper "Zhicheng He, Jie Liu*, Na Li, and Yalou Huang. Learning Network-to-Network Model for Content-rich Network Embed…☆12May 15, 2019Updated 7 years ago
- Sentiwordnet (English and Spanish linked) binding class to perform Sentiment Analysis and Opinion Mining☆15Sep 27, 2013Updated 12 years ago
- Deep nonparametric estimation of discrete conditional distributions via smoothed dyadic partitioning☆15Apr 19, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A benchmark dataset for evaluating dialog system and natural language generation metrics.☆39Jun 13, 2022Updated 3 years ago
- Generating NEW Reuters articles from Reuters articles.☆16Jan 10, 2017Updated 9 years ago
- 伪原创相关☆14Sep 4, 2019Updated 6 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 3 months ago
- 百度百科 500 万数据集☆50Dec 1, 2023Updated 2 years ago
- Repository for Unsupervised Sentence Compression using Denoising Auto-Encoders☆47Jul 25, 2024Updated last year
- 事件抽取☆10Dec 15, 2016Updated 9 years ago
- 基于百度LAC项目的PHP中文智能分词库☆10Jun 25, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Aug 28, 2018Updated 7 years ago
- Keras implementation of 'Convolutional Neural Networks for Sentence Classification. EMNLP 2014. Y. Kim.☆13Jan 20, 2017Updated 9 years ago
- 新闻文本自动摘要, 以Textrank 为基础,融入 标题特征,单句位置特征,重要实体特征,线索词特征,做句子的综合权重计算,并使用MMR算法,兼顾自动摘要的主题相关性和摘要多样性。☆26May 13, 2022Updated 4 years ago
- 手动实现Elasticsearch的倒排索引以及BM25算法☆48Jan 9, 2019Updated 7 years ago
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- 中文问答系统:使用NLP相关技术,对搜索引擎,问答社区等进行信息抽取,文本概括等,支持通识问答,社区问答和部分专业问答☆32Jun 21, 2022Updated 3 years ago
- 文本生成 - 通过商品参数和图片自动生成营销文本☆12Sep 17, 2021Updated 4 years ago
- Modify Chinese text, modified on LaserTagger Model. 文本复述,基于lasertagger做中文文本数据增强。☆321Jan 3, 2024Updated 2 years ago
- 经过强化的goose3通用网页提取器(添加作者VX: 862187570 , Python交流学习)☆16Nov 18, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 中文姓名与性别的相关性分析☆13May 16, 2016Updated 10 years ago
- ☆16Jun 18, 2022Updated 3 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- Convert digital images into georeferenced rasters☆14Jun 3, 2026Updated last week
- this demo for elasticsearch6.X user plugin☆12Mar 19, 2018Updated 8 years ago
- ☆13Jul 15, 2021Updated 4 years ago
- 网页正文及正文图片提取,基于哈工大的《基于行块分布函数的通用网页正文抽取》算法☆11Jan 22, 2016Updated 10 years ago