实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mahout、weka等代码,不能灵活地进行中文分词、停用词过滤、词频统计、TF-IDF等,也就是向量化和特征提取没有自己手写相对灵活。
☆22Sep 13, 2016Updated 9 years ago
Alternatives and similar repositories for ChineseTextClassifier
Users that are interested in ChineseTextClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于SVM的短文本分类研究☆19Sep 24, 2014Updated 11 years ago
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 13 years ago
- 情感分析|文本分类|实体识别|语义联想|摘要提取☆10May 25, 2017Updated 8 years ago
- 这是对word2vec的一些改进和应用。☆13May 18, 2017Updated 8 years ago
- 京东老版本的架构示例☆10Aug 14, 2013Updated 12 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- NGramSynonymTokenizer for Elasticsearch☆24Dec 14, 2021Updated 4 years ago
- 这是一个工具程序集合,方便我们平时对数据进行预处理。针对文本处理的内容较多。包括分词(集成了张华平分词、结巴分词)、文件处理增强(如读取文本到Map中,保存文本到Map)和语料模型(把文档转换成矩阵,就算单词数量等)☆21Oct 3, 2024Updated last year
- a simple implementation of textrank algorithm for nlp keywords extraction☆27May 2, 2017Updated 8 years ago
- Simple fully-connected highway networks using TensorFlow.☆25Aug 21, 2017Updated 8 years ago
- ☆12Mar 21, 2024Updated 2 years ago
- A simple Image retrieval system built using NodeJS. Work in progress.☆10Nov 12, 2015Updated 10 years ago
- For icibm paper☆11Feb 23, 2017Updated 9 years ago
- Android框架☆15Dec 5, 2018Updated 7 years ago
- 基于情感词典和朴素贝叶斯算法实现中文文本情感分类☆84May 22, 2014Updated 11 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PostGIS 2.0.5 for GreenPlum 4.3.x☆12Oct 25, 2016Updated 9 years ago
- a mini blog with NoSql,Dubbo and Spring☆11Nov 29, 2013Updated 12 years ago
- a word2vec impl of Chinese language, based on deeplearning4j and ansj☆30Feb 19, 2021Updated 5 years ago
- JSONDB (deprecated)☆36Jan 12, 2013Updated 13 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- Deep neural network inference transpiler tool for tflite and NNAPI☆12Jul 16, 2018Updated 7 years ago
- Implementation of semantic question matching with deep learning approaches mentioned in the blog of Quora.☆14Jun 1, 2017Updated 8 years ago
- ☆53Jan 23, 2017Updated 9 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 基于朴素贝叶斯模型的文本分类器☆14Jun 24, 2016Updated 9 years ago
- 基于《知网》的语义相似度计算 python2.7 API☆13Jun 23, 2017Updated 8 years ago
- parallel corpora for any languages supported by glosbe.com☆10Feb 9, 2016Updated 10 years ago
- hadoop training examples for aura.cn☆17Jan 18, 2019Updated 7 years ago
- Discord Bot in python with rasa nlu, tensorflow, discord api☆10Oct 15, 2018Updated 7 years ago
- A tool that allows you to search, delete, batch delete redis key, preview value of key, flush current db or flush or db.☆13Jun 14, 2022Updated 3 years ago
- lqshanshuo的量化投资方案☆11Nov 12, 2020Updated 5 years ago
- opencart2.0 中文包 简化注册 支付宝 适应中国国情☆10Nov 14, 2014Updated 11 years ago
- Example code - use word embeddings to make emoji prediction smarter with context☆11Sep 14, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ETL management platform based on Kettle☆11Jan 3, 2019Updated 7 years ago
- Snowflake gRPC Server(Golang) and Client(PHP)☆16Mar 19, 2026Updated 3 weeks ago
- A implementation of pinyin syllable segmentation (刘政怡, 吴建国 and 刘慧婷, 2008. 音节切分歧义方法研究. 计算机技术与发展, 18(8), pp.35-38.)☆13Apr 8, 2019Updated 7 years ago
- 基于websocket的浏览器推送服务器☆11Oct 12, 2017Updated 8 years ago
- A fulltext search backend, especially for static website having sitemap.☆17May 8, 2018Updated 7 years ago
- magicaldrag-2.2.9-release20200716☆12Jul 17, 2020Updated 5 years ago
- ☆10Sep 29, 2017Updated 8 years ago