实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mahout、weka等代码,不能灵活地进行中文分词、停用词过滤、词频统计、TF-IDF等,也就是向量化和特征提取没有自己手写相对灵活。
☆22Sep 13, 2016Updated 9 years ago
Alternatives and similar repositories for ChineseTextClassifier
Users that are interested in ChineseTextClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于SVM的短文本分类研究☆19Sep 24, 2014Updated 11 years ago
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 13 years ago
- 算法测试,包含常用的矩阵算法、mahout、weka、R等基础算法包。☆12Apr 26, 2015Updated 11 years ago
- 情感分析|文本分类|实 体识别|语义联想|摘要提取☆10May 25, 2017Updated 8 years ago
- 这是对word2vec的一些改进和应用。☆13May 18, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NGramSynonymTokenizer for Elasticsearch☆24Dec 14, 2021Updated 4 years ago
- 滚动到底部时加载更多内容☆11Mar 14, 2016Updated 10 years ago
- 这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等)☆21Oct 3, 2024Updated last year
- a simple implementation of textrank algorithm for nlp keywords extraction☆27May 2, 2017Updated 9 years ago
- Simple fully-connected highway networks using TensorFlow.☆25Aug 21, 2017Updated 8 years ago
- 敏感信息,垃圾信息,黄赌毒信息判断☆11Jul 17, 2017Updated 8 years ago
- Android框架☆15Dec 5, 2018Updated 7 years ago
- 基于情感词典和朴素贝叶斯算法实现中文文本情感分类☆84May 22, 2014Updated 11 years ago
- PostGIS 2.0.5 for GreenPlum 4.3.x☆12Oct 25, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- a word2vec impl of Chinese language, based on deeplearning4j and ansj☆30Feb 19, 2021Updated 5 years ago
- ☆11Nov 4, 2022Updated 3 years ago
- JSONDB (deprecated)☆36Jan 12, 2013Updated 13 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- GeoJSON Jackson Serializers and Deserializers for PostGIS Geometry objects☆15May 29, 2024Updated last year
- Deep neural network inference transpiler tool for tflite and NNAPI☆12Jul 16, 2018Updated 7 years ago
- Implementation of semantic question matching with deep learning approaches mentioned in the blog of Quora.☆14Jun 1, 2017Updated 8 years ago
- ☆53Jan 23, 2017Updated 9 years ago
- Quantopian lectures notebook translation☆24May 1, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于朴素贝叶斯模型的文本分类器☆14Jun 24, 2016Updated 9 years ago
- 🎥🤖 302 AI Audio and Video Summary 🚀✨☆18Aug 25, 2025Updated 8 months ago
- parallel corpora for any languages supported by glosbe.com☆11Feb 9, 2016Updated 10 years ago
- hadoop training examples for aura.cn☆17Jan 18, 2019Updated 7 years ago
- Discord Bot in python with rasa nlu, tensorflow, discord api☆10Oct 15, 2018Updated 7 years ago
- A tool that allows you to search, delete, batch delete redis key, preview value of key, flush current db or flush or db.☆13Jun 14, 2022Updated 3 years ago
- opencart2.0 中文包 简化注册 支付宝 适应中国国情☆10Nov 14, 2014Updated 11 years ago
- Example code - use word embeddings to make emoji prediction smarter with context☆11Sep 14, 2018Updated 7 years ago
- ETL management platform based on Kettle☆11Jan 3, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 烂笔头应用的源码,下载地址 http://www.wandoujia.com/apps/com.ted.jots.myjot☆13Jan 3, 2017Updated 9 years ago
- 基于websocket的浏览器推送服务器☆11Oct 12, 2017Updated 8 years ago
- A fulltext search backend, especially for static website having sitemap.☆17May 8, 2018Updated 7 years ago
- magicaldrag-2.2.9-release20200716☆12Jul 17, 2020Updated 5 years ago
- identify the brand of a car based on one car image☆21Feb 1, 2013Updated 13 years ago
- php版本的snowflake实现☆12Aug 29, 2016Updated 9 years ago
- Object recognition and tracking using OpenCV☆10Nov 29, 2019Updated 6 years ago