实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mahout、weka等代码,不能灵活地进行中文分词、停用词过滤、词频统计、TF-IDF等,也就是向量化和特征提取没有自己手写相对灵活。
☆23Sep 13, 2016Updated 9 years ago
Alternatives and similar repositories for ChineseTextClassifier
Users that are interested in ChineseTextClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 13 years ago
- 情感分析|文本分类|实体识别|语义联想|摘要提取☆10May 25, 2017Updated 9 years ago
- Elasticsearch zabbix 监控☆16Mar 9, 2017Updated 9 years ago
- NGramSynonymTokenizer for Elasticsearch☆24Dec 14, 2021Updated 4 years ago
- 【JavaSE】Java 知识汇总(资源,工具,笔记,源码,文章,文档分类整理);项目由Gradle版本工具构建;目前持续更新中...☆31May 11, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等)☆21Oct 3, 2024Updated last year
- a simple implementation of textrank algorithm for nlp keywords extraction☆28May 2, 2017Updated 9 years ago
- Simple fully-connected highway networks using TensorFlow.☆25Aug 21, 2017Updated 8 years ago
- A simple Image retrieval system built using NodeJS. Work in progress.☆10Nov 12, 2015Updated 10 years ago
- 敏感信息,垃圾信息,黄赌毒信息判断☆11Jul 17, 2017Updated 8 years ago
- Android框架☆14Dec 5, 2018Updated 7 years ago
- 基于情感词典和朴素贝叶斯算法实现中文文本情感分类☆85May 22, 2014Updated 12 years ago
- PostGIS 2.0.5 for GreenPlum 4.3.x☆12Oct 25, 2016Updated 9 years ago
- maat是一个分布式事务中间件,实现了基于可靠消息的最终一致性事务控制.其可靠消息服务通过独立消息服务方案实现,与业务系统耦合低,目前支持的MQ有RocketMQ☆12Dec 6, 2018Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Similarity is an optical as well as keyword based image similarity search engine built on top of Lire.☆31Aug 2, 2017Updated 8 years ago
- GeoJSON Jackson Serializers and Deserializers for PostGIS Geometry objects☆15May 29, 2024Updated 2 years ago
- Deep neural network inference transpiler tool for tflite and NNAPI☆12Jul 16, 2018Updated 7 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 9 years ago
- Quantopian lectures notebook translation☆24May 1, 2020Updated 6 years ago
- ☆13Sep 6, 2015Updated 10 years ago
- ☆11Apr 30, 2016Updated 10 years ago
- 基于《知网》的语义相似度计算 python2.7 API☆13Jun 23, 2017Updated 8 years ago
- 基于朴素贝叶斯模型的文本分类器☆14Jun 24, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 南京本地程序员俱乐部 。重点解决程序员交友、程序员恋爱、程序员相亲、程序员找对象的问题,真正开源交友。我是细心姐姐。 微信搜索关注《大确幸》,有爱有行动。http://daquexing.cn☆11May 17, 2020Updated 6 years ago
- parallel corpora for any languages supported by glosbe.com☆11Feb 9, 2016Updated 10 years ago
- Discord Bot in python with rasa nlu, tensorflow, discord api☆10Oct 15, 2018Updated 7 years ago
- lqshanshuo的量化投资方案☆11Nov 12, 2020Updated 5 years ago
- opencart2.0 中文包 简化注册 支付宝 适应中国国情☆10Nov 14, 2014Updated 11 years ago
- ETL management platform based on Kettle☆11Jan 3, 2019Updated 7 years ago
- Snowflake gRPC Server(Golang) and Client(PHP)☆16Mar 19, 2026Updated 2 months ago
- 烂笔头应用的源码,下载地址 http://www.wandoujia.com/apps/com.ted.jots.myjot☆12Jan 3, 2017Updated 9 years ago
- A implementation of pinyin syllable segmentation (刘政怡, 吴建国 and 刘慧婷, 2008. 音节切分歧义方法研究. 计算机技术与发展, 18(8), pp.35-38.)☆13Apr 8, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于websocket的浏览器推送服务器☆11Oct 12, 2017Updated 8 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- magicaldrag-2.2.9-release20200716☆12Jul 17, 2020Updated 5 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Package ikitai is an optimizing just-in-time compiler for SSA-transformed Go.☆17Jun 6, 2020Updated 6 years ago
- identify the brand of a car based on one car image☆21Feb 1, 2013Updated 13 years ago
- KD Tree Implementation from Prof. Simon D. Levy (Washington & Lee University)☆24Oct 1, 2015Updated 10 years ago