hailinli / crawGovDataLinks
爬取政府网站的数据(赣州、吐鲁番、大理、太原、大庆)
☆33Updated 7 years ago
Alternatives and similar repositories for crawGovData
Users that are interested in crawGovData are comparing it to the libraries listed below
Sorting:
- An exploration for Eventline (important news Rank organized by pulic time),针对某一事件话题下的新闻报道集合,通过使用docrank算法,对新闻报道进行重要性识别,并通过新闻报道时间挑选出时间线上重要…☆226Updated 7 years ago
- Event monitor based on online news corpus including event storyline and analysis,基于给定事件关键词,采集事件资讯,对事件进行挖掘和分析。☆153Updated 7 years ago
- self complemented WeiboIndexSpyder based on Selenium ,新浪微博指数(微指数)采集,包括综合指数,移动端指数,PC端指数☆31Updated 7 years ago
- Tookit-Sihui, a tool of some common algorithm, AI文本混合科学计算器(calculator-sihui), 句子词频-逆文本频率(TF-IDF),搜索BM25, 前缀树搜索关键词(trietree), 模板匹配-递归函数(fu…☆24Updated 4 years ago
- self complemented BaiduIndexSpyder based on Selenium , index image decode and num image transfer,基于关键词的历时百度搜索指数自动采集☆42Updated 7 years ago
- 文本对关系比较 - 语义相似度、字面相似度、文本蕴含等等☆55Updated 6 years ago
- Self complemented text feature extraction using algorithms including CHI, DF, IG, MI for the experiment of text classification based on s…☆49Updated 7 years ago
- 从门户网站爬取新闻的摘要-标题对使用seq2seq根据摘要生成标题☆45Updated 8 years ago
- AC自动机python的实现,并进行了优化。 主要修复了 查询不准确的问题。☆77Updated 4 years ago
- 法研杯犯罪金额提取☆14Updated 3 years ago
- 无监督中文仿真评论自动生成。 Unsupervised Automatic Generation of Chinese Fake Reviews.☆84Updated 6 years ago
- 中文分词工具评估☆63Updated 3 years ago
- 公司、企业名称模糊匹配,基于词频的公司名主体提取,基于编辑距离的匹配度☆41Updated 5 years ago
- 中文语料:大量人工标注样本,非常有价值 !!!☆11Updated 6 years ago
- An Open-Source Package for Chinese Open-domain Conversational Chatbot (中文闲聊对话系统,一键部署微信闲聊机器人)☆108Updated 2 years ago
- 【梳理】FDDC2018金融算法挑战赛02-A股上市公司公告信息抽取☆94Updated 7 years ago
- 常用的中文停用词表☆78Updated 7 years ago
- 智能客服☆110Updated 6 years ago
- ☆37Updated 6 years ago
- Self complemented sentiment words expansion using seed sentiment words and so-pmi , this method is tested to be effective, 基于情感种子词与so-pmi…☆87Updated 7 years ago
- CCKS2019评测任务五-公众公司公告信息抽取,第3名☆122Updated 6 years ago
- 金庸小说人物关系图谱构建☆63Updated 6 years ago
- a beautiful method for cluster or community detection☆52Updated 6 years ago
- 互联网舆情企业风险事件的识别和预警,将公司名称进行实体提取,对新闻进行舆情分类,比赛地址为:http://ailab.aiwin.org.cn/competitions/48#learn_the_details☆19Updated 4 years ago
- 中国法研杯-司法人工智能挑战赛☆93Updated 7 years ago
- CCKS2019面向金融领域的事件主体抽取☆46Updated 6 years ago
- 利用文本挖掘技术进行新闻热点关注问题分析☆170Updated 7 years ago
- Quick run NLP in many task 快速运行分类、序列标注、匹配、生成等NLP任务的Tensorflow框架 (中文 NLP 支持分布式)☆31Updated 5 years ago
- 《机器阅读理解:算法与实践》代码☆157Updated last year
- 汉字字符特征提取工具,可以提取出字符中的字音(声母、韵母、声调)、字形(偏旁、部首)、四角编码等特征,同时可作为tensor输入到模型☆138Updated 5 years ago