近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的智慧政务系统已经是社会治理创新发展的新趋势,对提升政府的管理水平和施政效率具有极大的推动作用。 本文针对“智慧政务”中的居民投诉建议文本评论数据,基于向量空间模型算法提取了文本关键词并我们采用了多种机器学习分类模型进行测试,从最终得到线性支持向量回归算法相对较优的结果,F1-Score评价指标达0.86。 在挖掘热点问题的前期处理上,使用了余弦相似度计算整理出文本相似的同类主题并加以筛选,通过在SPSS中建立基于因子…
☆36Jun 28, 2020Updated 5 years ago
Alternatives and similar repositories for Text-Mining
Users that are interested in Text-Mining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 第八届“泰迪杯”数据挖掘挑战赛的一点心得☆11Nov 26, 2020Updated 5 years ago
- 为了了解观看热门电影的用户都有哪些特征,爬取猫眼网站热门电影的评论数据进行分析:评分统计、词云、城市评论数量与平均评分、性别分析、评论数量与时间的关系。☆12Nov 14, 2019Updated 6 years ago
- ☆11Nov 27, 2018Updated 7 years ago
- 微博舆情分析系统☆10Jun 21, 2022Updated 3 years ago
- 抓取微博转发关系数据,weibo repost☆10Nov 16, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 一个基于朴素贝叶斯算法的新闻文本分类器☆13Jan 12, 2018Updated 8 years ago
- 2020年第八届泰迪杯数据挖掘C题“智慧政务文本挖掘”特等奖作品(论文与代码)☆67Sep 4, 2025Updated 9 months ago
- 系统性风险 指标计算☆10Apr 20, 2020Updated 6 years ago
- 本项目采用Keras和ALBERT实现文本多标签分类任务,其中对ALBERT进行微调。☆13Jan 5, 2021Updated 5 years ago
- 基于jieba分词和lda模型的主题分析☆19Apr 20, 2019Updated 7 years ago
- 基于CNN的新浪新闻文本分类☆11Jul 22, 2019Updated 6 years ago
- Statistical Arbitrage & Algorithmic Trading: time series analysis and the presence of cointegration in cryptocurrency price series.☆11Jan 10, 2019Updated 7 years ago
- Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.☆18Dec 26, 2022Updated 3 years ago
- Bert中文文本分类☆41Apr 26, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- GBDT for regression☆10Dec 16, 2018Updated 7 years ago
- ☆19Sep 3, 2024Updated last year
- 面向数据安全治理的数据内容智能发现与分级分类——一种通用的应对中文无监督文本分类的解题思路分享☆52Jul 6, 2021Updated 4 years ago
- 中文文本分类(目前是二分类)☆46May 27, 2017Updated 9 years ago
- test HanLP vs LTP☆11Mar 28, 2018Updated 8 years ago
- 面向数据安全治理的数据内容智能发现与分级分类 A榜rank7 B榜rank10☆35Feb 28, 2021Updated 5 years ago
- 本项目采用Keras和ALBERT实现文本多分类任务,其中对ALBERT进行微调。☆17Jan 5, 2021Updated 5 years ago
- 一个基本的多层lstm rnn模型,能实现中英文文本的二分类或多分类☆49Nov 5, 2018Updated 7 years ago
- 一个BERT+BiLSTM的情感分析 BaseLine☆26May 9, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于标签的用户行为日志大数据分析系统☆21Feb 3, 2021Updated 5 years ago
- 键盘鼠标操作的录制、重放工具。☆17Feb 4, 2021Updated 5 years ago
- Samples of multi-class text classification with Differential Privacy Tensorflow 2.0☆11Feb 8, 2020Updated 6 years ago
- NLPCC 2017 task3 article text summary☆23Jul 25, 2017Updated 8 years ago
- Calculate the domain age with python script☆11Mar 27, 2018Updated 8 years ago
- 本项目综合运用d3、echarts来完成可视化工作,实现了对nba两场比赛的可视化数据分析,包括球员运动轨迹、个人数据、传球次数以及得分位置等多种可交互式图表。通过可视化方法,我们能够进一步深入分析球队的具体情况,便于制定更佳的战术。☆15Dec 19, 2022Updated 3 years ago
- Weight of Evidence,基于iv值最大思想求最优分箱☆15Oct 24, 2019Updated 6 years ago
- lda 主题模型 用于主题提取和文本分类☆26Jul 8, 2017Updated 8 years ago
- ☆11Nov 11, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Using three approaches to calculate Value at Risk and Conditional Value at Risk of a portfolio of assets.☆13Apr 24, 2020Updated 6 years ago
- 本地解析+存储的Epub电子书阅读器☆10Jul 11, 2023Updated 2 years ago
- this repo is mnbvc text quality classification using fastText☆16Oct 2, 2023Updated 2 years ago
- 1.收集影响我国34个省市房地产价格的相关因子进行因子分析,将因子命名为3类。2.使用K-means对我国房地产价格进行聚类。3.使用多元回归分析针对广东省房地产价格深入分析并预测。☆13Aug 21, 2020Updated 5 years ago
- 基于多通道卷积神经网络的汽车评论情感分析系统☆12Nov 20, 2023Updated 2 years ago
- 第十届大学生服务外包大赛--A01商品短文本分类。基于CNN、Bi-LSTM、Attention、Adversarial等方法实现商品短文本分类任务,并基于Flask开发Web版本的交互演示界面。☆29Apr 29, 2022Updated 4 years ago
- A beautiful and applicable PyQt5 user interface.☆14May 15, 2021Updated 5 years ago