基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
☆110Dec 24, 2018Updated 7 years ago
Alternatives and similar repositories for TextClassification
Users that are interested in TextClassification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于CNN的新浪新闻文本分类☆11Jul 22, 2019Updated 6 years ago
- 基于SVM的中文文本分类; python☆13May 24, 2019Updated 7 years ago
- 使用LDA+SVM进行文本的分 类☆22Jul 23, 2017Updated 8 years ago
- 毕业论文代码 + 评论文本数据获取+数据清洗+文本数据向量化+将数据放进分类器(KNN+Naive Bayes+SVM)中训练+结果评估☆55May 17, 2022Updated 4 years ago
- 新闻分类(文本分类CNN-RNN)☆10Sep 2, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 利用支持向量机实现中文文本分类☆29May 28, 2018Updated 8 years ago
- 对文本进行分词,去除停用词,LDA建模,利用贝叶斯算法进行新闻分类☆17Mar 22, 2018Updated 8 years ago
- 基于Keras的中文文本分类系统,支援多种模型架构和训练策略,实验数据为中文新闻分类文本cnews数据集。☆51Jun 6, 2025Updated last year
- SVM中文文本分类☆13Mar 13, 2022Updated 4 years ago
- 文本分类基准测试☆25Mar 29, 2018Updated 8 years ago
- 朴素贝叶斯实现的文本分类(新闻分类)☆67Dec 29, 2015Updated 10 years ago
- 利用贝叶斯算法设计实现文档分类系统☆17Aug 21, 2016Updated 9 years ago
- 自实现朴素贝叶斯分类器,文本分类一百万条新闻☆41Nov 24, 2018Updated 7 years ago
- 文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据中国知网URL的规律,爬取70多万条2014年公开的发明专利,然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取,依照词频排序提取…☆108Mar 14, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 给定训练新闻数据集,可以对输入的测试新闻进行自动分类识别☆19Jul 26, 2015Updated 10 years ago
- 使用scik-learn 实现k-means,KNN,SVM,贝叶斯,topic_extraction等算法,同时评估分类的准确率,召回率和F值。语料库是中文文本☆43Jul 23, 2017Updated 8 years ago
- ☆35Apr 7, 2020Updated 6 years ago
- 零基础入门NLP - 新闻文本分类 正式赛第一名方案☆234Sep 10, 2020Updated 5 years ago
- FinnewsHunter: Multi-agent financial intelligence platform powered by AgenticX. Real-time news analysis, sentiment fusion, and alpha fact…☆1,461Jan 13, 2026Updated 5 months ago
- Re-implementation of multi-source pointer network.☆27Mar 25, 2020Updated 6 years ago
- 新闻文本分类,TextCNN模型,pytorch实现. gitee//gitee.com/worldcreator/News_Classification☆24May 28, 2021Updated 5 years ago
- Tensorflow Implementation of cnn text classification☆12Aug 1, 2018Updated 7 years ago
- Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)☆26Feb 5, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于深度学习(tensorflow)的中文文本分类☆15Apr 3, 2019Updated 7 years ago
- "Cross-lingual Language Model Pretraining for Retrieval". (WWW 2021)☆10Jun 17, 2022Updated 4 years ago
- 中文文本分类实践,基于搜狗新闻语料库,采用传统机器学习方法以及预训练模型等方法☆198Dec 16, 2020Updated 5 years ago
- 基于Bert的文本情感分析模型(含semeval14数据集)☆15Jun 30, 2019Updated 7 years ago
- RankNet, LambdaRank, LambdaMART, GBrank☆14Nov 16, 2013Updated 12 years ago
- Nowcasting macroeconomic indicators using Google Trends☆10Jun 23, 2022Updated 4 years ago
- 基于CNN实现的文本分类应用☆26Nov 17, 2020Updated 5 years ago
- 使用 Keras 进行中文文本分类☆53Jan 6, 2018Updated 8 years ago
- 包含leleketang.com做文库十万余条作文信息,每条作文包含标题、作者、时间、地点、正文、评语、等级等信息。根据文本数据,从多个维度对数据进行分析,并用python中的pyecharts绘制图表。使用TF-IDF和Doc2Vec模型统计关键词☆13Oct 6, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆115Jul 9, 2018Updated 7 years ago
- 公众号☆10Jul 24, 2023Updated 2 years ago
- 基于django的微信基础模块☆10Dec 18, 2016Updated 9 years ago
- 中医骨科电子病历数据集☆13Mar 30, 2019Updated 7 years ago
- Predicting chicago crime data using network science☆10Aug 26, 2018Updated 7 years ago
- 中文文本分类,使用搜狗文本分类语料库☆123Jul 31, 2016Updated 9 years ago
- 天池-Datawhale 零基础入门NLP-新闻文本分类 最终榜Top10分享☆62Sep 27, 2020Updated 5 years ago