基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
☆110Dec 24, 2018Updated 7 years ago
Alternatives and similar repositories for TextClassification
Users that are interested in TextClassification are comparing it to the libraries listed below
Sorting:
- 使用LDA+SVM进行文本的分类☆22Jul 23, 2017Updated 8 years ago
- 基于CNN的新浪新闻文本分类☆11Jul 22, 2019Updated 6 years ago
- 利用支持向量机实现中文文本分类☆29May 28, 2018Updated 7 years ago
- 基于Keras的中文文本分类系统,支援多种模型架构和训练策略,实验数据为中文新闻分类文本cnews数据集。☆48Jun 6, 2025Updated 8 months ago
- 文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据中国知网URL的规律,爬取70多万条2014年公开的发明专利,然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取,依照词频排序提取…☆108Mar 14, 2018Updated 7 years ago
- SVM中文文本分类☆13Mar 13, 2022Updated 3 years ago
- ☆35Apr 7, 2020Updated 5 years ago
- 基于Bert的文本情感分析模型(含semeval14数据集)☆14Jun 30, 2019Updated 6 years ago
- 使用scik-learn 实现k-means,KNN,SVM,贝叶斯,topic_extraction等算法,同时评估分类的准确率,召回率和F值。语料库是中文文本☆43Jul 23, 2017Updated 8 years ago
- 零基础入门NLP - 新闻文本分类 正式赛第一名方案☆236Sep 10, 2020Updated 5 years ago
- RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23☆23Aug 12, 2023Updated 2 years ago
- FinnewsHunter: Multi-agent financial intelligence platform powered by AgenticX. Real-time news analysis, sentiment fusion, and alpha fact…☆1,330Jan 13, 2026Updated last month
- Official Code for Merging Statistical Feature via Adaptive Gate for Improved Text Classification (AAAI2021)☆26Feb 5, 2022Updated 4 years ago
- Re-implementation of multi-source pointer network.☆28Mar 25, 2020Updated 5 years ago
- 文本分类基准测试☆25Mar 29, 2018Updated 7 years ago
- 汽车主题情感分析大赛冠军☆27Dec 10, 2018Updated 7 years ago
- 专门的股票分析项目☆29Dec 18, 2020Updated 5 years ago
- 股票交易数据处理的整个 业务流程 数据源--->数据采集--->数据归类--->数据储存--->数据分析--->数据可视化☆31Nov 23, 2016Updated 9 years ago
- 新闻文本分类,TextCNN模型,pytorch实现. gitee//gitee.com/worldcreator/News_Classification☆24May 28, 2021Updated 4 years ago
- 《剑指offer》第2版 Java 题解☆11Jan 8, 2019Updated 7 years ago
- 😄😐😠 情感分析(使用 emoji 可视化)☆10Sep 5, 2021Updated 4 years ago
- User springboot+neo4j+nlp to build knowledge graph☆11Sep 8, 2022Updated 3 years ago
- Predicting chicago crime data using network science☆10Aug 26, 2018Updated 7 years ago
- ☆10Dec 10, 2018Updated 7 years ago
- 悟空客户管理:客户关系管理是指企业为提高核心竞争力,利用相应的信息技术以及互联网技术协调企业与顾客间在销售、营销和服务上的交互,从而提升其管理方式,向客户提供创新式的个性化的客户交互和服务的过程。其最终目标是吸引新客户、保留老客户以及将已有客户转为忠实客户,增加市场。☆10Jun 21, 2022Updated 3 years ago
- ☆10May 11, 2017Updated 8 years ago
- 语音播报 功能集成百度语音合成包TTS☆10Jun 5, 2019Updated 6 years ago
- 本项目使用Keras实现Transformer模型来进行文本分类(中文、英文均支持)。☆12Mar 31, 2022Updated 3 years ago
- Temporal and Causal Relation extraction module for the Newsreader project.☆10Oct 26, 2015Updated 10 years ago
- 抓取国家统计局数据☆13May 4, 2016Updated 9 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago
- Match tokenized words and phrases within the original, untokenized, often messy, text.☆19Apr 11, 2023Updated 2 years ago
- 股票高频数据(数据来源:新浪)☆13Jan 29, 2020Updated 6 years ago
- 毕业设计:互联网新闻热点抽取系统☆10May 21, 2022Updated 3 years ago
- 一个基于朴素贝叶斯的胜率预测与计算项目。☆10Jul 5, 2019Updated 6 years ago
- 中文转emoji☆11Dec 17, 2018Updated 7 years ago
- The Stochastic Oscillator has two lines, the %K and %D. The %D line is more important to produce better trading signals.☆10Apr 16, 2018Updated 7 years ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Jul 25, 2024Updated last year
- Improving Opinion-Target Extraction with Character-Level Word Embeddings☆11Jan 25, 2018Updated 8 years ago