实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mahout、weka等代码,不能灵活地进行中文分词、停用词过滤、词频统计、TF-IDF等,也就是向量化和特征提取没有自己手写相对灵活。
☆22Sep 13, 2016Updated 9 years ago
Alternatives and similar repositories for ChineseTextClassifier
Users that are interested in ChineseTextClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于SVM的短文本分类研究☆19Sep 24, 2014Updated 11 years ago
- 情感分析|文本分类|实体识别|语义联想|摘要提取☆10May 25, 2017Updated 8 years ago
- 京东老版本的架构示例☆10Aug 14, 2013Updated 12 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- 滚动到底部时加载更多内容☆11Mar 14, 2016Updated 10 years ago
- 【JavaSE】Java 知识汇总(资源,工具,笔记,源码,文章,文档分类整理);项目由Gradle版本工具构建;目前持续更新中...☆31May 11, 2018Updated 7 years ago
- Simple fully-connected highway networks using TensorFlow.☆25Aug 21, 2017Updated 8 years ago
- ☆12Mar 21, 2024Updated 2 years ago
- For icibm paper☆11Feb 23, 2017Updated 9 years ago
- 敏感信息,垃圾信息,黄赌毒信息判断☆11Jul 17, 2017Updated 8 years ago
- Android框架☆15Dec 5, 2018Updated 7 years ago
- 基于情感词典和朴素贝叶斯算法实现中文文本情感分类☆83May 22, 2014Updated 11 years ago
- a word2vec impl of Chinese language, based on deeplearning4j and ansj☆29Feb 19, 2021Updated 5 years ago
- maat是一个分布式事务中间件,实现了基于可靠消息的最终一致性事务控制.其可靠消息服务通过独立消息服务方案实现,与业务系统耦合低,目前支持的MQ有RocketMQ☆12Dec 6, 2018Updated 7 years ago
- Image Similarity Search for Maps☆18Dec 1, 2015Updated 10 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- GeoJSON Jackson Serializers and Deserializers for PostGIS Geometry objects☆15May 29, 2024Updated last year
- Deep neural network inference transpiler tool for tflite and NNAPI☆12Jul 16, 2018Updated 7 years ago
- Implementation of semantic question matching with deep learning approaches mentioned in the blog of Quora.☆14Jun 1, 2017Updated 8 years ago
- ☆53Jan 23, 2017Updated 9 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 8 years ago
- Quantopian lectures notebook translation☆23May 1, 2020Updated 5 years ago
- ☆11Apr 30, 2016Updated 9 years ago
- 基于朴素贝叶斯模型的文本分类器☆14Jun 24, 2016Updated 9 years ago
- 基于《知网》的语义相似度计算 python2.7 API☆13Jun 23, 2017Updated 8 years ago
- parallel corpora for any languages supported by glosbe.com☆10Feb 9, 2016Updated 10 years ago
- hadoop training examples for aura.cn☆17Jan 18, 2019Updated 7 years ago
- 🎥🤖 302 AI Audio and Video Summary 🚀✨☆17Aug 25, 2025Updated 6 months ago
- opencart2.0 中文包 简化注册 支付宝 适应中国国情☆10Nov 14, 2014Updated 11 years ago
- ETL management platform based on Kettle☆11Jan 3, 2019Updated 7 years ago
- Example code - use word embeddings to make emoji prediction smarter with context☆11Sep 14, 2018Updated 7 years ago
- 电影评估推荐系统☆17Jul 31, 2016Updated 9 years ago
- A implementation of pinyin syllable segmentation (刘政怡, 吴建国 and 刘慧婷, 2008. 音节切分歧义方法研究. 计算机技术与发展, 18(8), pp.35-38.)☆13Apr 8, 2019Updated 6 years ago
- 基于websocket的浏览器推送服务器☆11Oct 12, 2017Updated 8 years ago
- vite技术揭秘、还原与实战☆10Mar 6, 2024Updated 2 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- Package ikitai is an optimizing just-in-time compiler for SSA-transformed Go.☆17Jun 6, 2020Updated 5 years ago
- identify the brand of a car based on one car image☆21Feb 1, 2013Updated 13 years ago
- This project contains the necessary files to reproduce the paper: "Explaining Character-Aware Neural Networks for Word-Level Prediction: …☆12Nov 15, 2018Updated 7 years ago