近年来,随着微信、微博、市长信箱、阳光热线等网络问政平台逐步成为政府了解民意、汇聚民智、凝聚民气的重要渠道,各类社情民意相关的文本数据量不断攀升,给以往主要依靠人工来进行留言划分和热点整理的相关部门的工作带来了极大挑战。同时,随着大数据技术的发展,建立基于自然语言处理技术的智慧政务系统已经是社会治理创新发展的新趋势,对提升政府的管理水平和施政效率具有极大的推动作用。 本文针对“智慧政务”中的居民投诉建议文本评论数据,基于向量空间模型算法提取了文本关键词并我们采用了多种机器学习分类模型进行测试,从最终得到线性支持向量回归算法相对较优的结果,F1-Score评价指标达0.86。 在挖掘热点问题的前期处理上,使用了余弦相似度计算整理出文本相似的同类主题并加以筛选,通过在SPSS中建立基于因子…
☆35Jun 28, 2020Updated 5 years ago
Alternatives and similar repositories for Text-Mining
Users that are interested in Text-Mining are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 为了了解观看热门电影的用户都有哪些特征,爬取猫眼网站热门电影的评论数据进行分析:评分统计、词云、城市评论数量与平均评分、性别分析、评论数量与时间的关系。☆12Nov 14, 2019Updated 6 years ago
- Code for Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model, IJCAI 2020☆12Nov 26, 2020Updated 5 years ago
- 以京东评论作为数据集,使用常见的机器学习算法如KNN、SVM、逻辑回归、贝叶斯、xgboost等等算法进行分类。使用深度学习中的CNN、RNN、CNN和RNN连接、Bi-GRU、bert模型进行分类。使用fastnlp的框架搭建文本分类。☆31Jul 2, 2020Updated 5 years ago
- ☆14Mar 15, 2022Updated 4 years ago
- 2020年第八届泰迪杯数据挖掘C题“智慧政务文本挖掘”特等奖作品(论文与代码)☆68Sep 4, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 系统性风险指标计算☆10Apr 20, 2020Updated 5 years ago
- 基于jieba分词和lda模型的主题分析☆19Apr 20, 2019Updated 6 years ago
- 图片性别识别Demo和模型☆10Oct 28, 2020Updated 5 years ago
- 基于CNN的新浪新闻文本分类☆11Jul 22, 2019Updated 6 years ago
- Statistical Arbitrage & Algorithmic Trading: time series analysis and the presence of cointegration in cryptocurrency price series.☆11Jan 10, 2019Updated 7 years ago
- Dataset and Source code of paper 'Enhancing Keyphrase Extraction from Academic Articles with their Reference Information'.☆18Dec 26, 2022Updated 3 years ago
- 中文长文本摘要数据集 / 社科论文-摘要数据集☆20Aug 17, 2023Updated 2 years ago
- Bert中文文本分类☆41Apr 26, 2019Updated 6 years ago
- GBDT for regression☆10Dec 16, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 面向数据安全治理的数据内容智能发现与分级分类——一种通用的应对中文无监督文本分类的解题思路分享☆52Jul 6, 2021Updated 4 years ago
- 中文文本分类(目前是二分类)☆47May 27, 2017Updated 8 years ago
- Conjuntos de funciones realizadas y recopiladas por mí durante la carrera.☆11Aug 2, 2022Updated 3 years ago
- test HanLP vs LTP☆11Mar 28, 2018Updated 8 years ago
- 面向数据安全治理的数据内容智能发现与分级分类 A榜rank7 B榜rank10☆35Feb 28, 2021Updated 5 years ago
- 本项目采用Keras和ALBERT实现文本多分类任务,其中对ALBERT进行微调。☆17Jan 5, 2021Updated 5 years ago
- 一个基本的多层lstm rnn模型,能实现中英文文本的二分类或多分类☆49Nov 5, 2018Updated 7 years ago
- 中文地址解析系统,解析出省市区县名称及ID☆14Oct 5, 2022Updated 3 years ago
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 一个BERT+BiLSTM的情感分析 BaseLine☆26May 9, 2020Updated 5 years ago
- Social network analysis (SNA) is the process of investigating social structures through the use of networks and graph theory.[1] It chara…☆23Jul 9, 2019Updated 6 years ago
- 数据治理整体架构☆10Nov 11, 2019Updated 6 years ago
- 基于标签的用户行为日志大数据分析系统☆20Feb 3, 2021Updated 5 years ago
- LCSTS,ROUGE,short text summarization,NLG,seq2seq☆23Jul 25, 2017Updated 8 years ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- ☆12Jan 14, 2020Updated 6 years ago
- 键盘鼠标操作的录制、重放工具。☆17Feb 4, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 模仿阿里云实现的机器学习PAI可视化建模管理平台☆10Jan 4, 2023Updated 3 years ago
- NLPCC 2017 task3 article text summary☆24Jul 25, 2017Updated 8 years ago
- A systematic CPU/GPU performance study of lightgbm and xgboost classifiers for different data shapes and hardware setups.☆18Aug 12, 2020Updated 5 years ago
- a modification version of Python Interpreter, making running Python code unpredictable and unstable.☆15Sep 1, 2022Updated 3 years ago
- Program to convert picture of the question into text using OCR and perform google search. Applicable for Loco and HQ Trivia.☆16Oct 19, 2018Updated 7 years ago
- 《Python金融大数据挖掘与分析全流程详解》学习笔记及代码☆14Aug 4, 2020Updated 5 years ago
- 医学问诊问答,NER,关系抽取☆14May 28, 2020Updated 5 years ago