面向证券信息类专业搜索引擎,基于WEB信息挖掘技术的专业搜索引擎设计与实现并着重分析基于特定主题的爬取方法,通过下载Internet上WEB文档,进行过滤、分词、转换等处理工作,并建立索引数据库,最终可由检索器通过用户输入查询关键字,搜索器支持微博客、短信等内容短小而又不规范的内容分析。针对证券信息类网站选取40-50个,进行数据挖掘,使用概念索引方法进行WEB全文索引,程序设计中考虑了搜索策略和搜索引擎数据优化等问题,同时特征词汇按照影响权重排序,可以输出权重值。对基于WEB挖掘的中文专业搜索引擎的设计与实现具有较好的理论与实验价值。
☆24Dec 3, 2018Updated 7 years ago
Alternatives and similar repositories for niusouyixia
Users that are interested in niusouyixia are comparing it to the libraries listed below
Sorting:
- 基于Django的的微博转发分析系统☆14Oct 26, 2018Updated 7 years ago
- zookeeper配置中心,毫秒级别配置更新配置☆15Dec 16, 2022Updated 3 years ago
- CTP with python3.4.3☆10Aug 1, 2017Updated 8 years ago
- Jenkins Maven Repository Plugin☆20Oct 14, 2015Updated 10 years ago
- convert weibo(sina/tencent/netease) data source into an intermediate format supported by citespace☆10Sep 27, 2011Updated 14 years ago
- A simple but comprehensive way of analysing transitive node dependencies☆10Aug 5, 2016Updated 9 years ago
- SVN批量迁移Git工具☆11Aug 29, 2018Updated 7 years ago
- Flymaple - Another Quadcopter in Open Source way.☆19Jan 28, 2014Updated 12 years ago
- 封装一层 Mongo-java-driver,让查询更加简单☆11Oct 25, 2018Updated 7 years ago
- Golang对接宝付、通联、富友金账户等支付平台☆13Mar 29, 2023Updated 2 years ago
- 大型超市运营系统框架,基于GWT EJB JPA,完整的Java EE Web Application☆10Jun 25, 2014Updated 11 years ago
- 想要抓取新浪微博数据,必须先要登录,但新浪也做了一定的预防措施,这是我用c#写了一个 使用http模拟登录新浪微博的示例代码。☆11Oct 22, 2014Updated 11 years ago
- 目前生产环境使用的elasticsearch☆10Apr 29, 2014Updated 11 years ago
- Python library for interacting with jenkins ci☆16May 2, 2018Updated 7 years ago
- 用angular+mongoose+express实现的菜谱应用☆26Feb 28, 2014Updated 12 years ago
- ☆10Mar 27, 2016Updated 9 years ago
- 爬取微博数据形成用户画像 登陆账号获取cookies 使用selenium,先调用chrome浏览器 最后改成PhantomJS,并根据其中的内容获取想要的数据☆11Mar 7, 2019Updated 7 years ago
- Keywords enrichment by autocompletion (AWS, PM, RDC, CDS, ...), google suggestion scraping Heavy multithreaded semantic corpus crawler S…☆12May 22, 2015Updated 10 years ago
- ☆11Jun 29, 2017Updated 8 years ago
- python版挖掘鸡☆12Nov 19, 2015Updated 10 years ago
- 实时数据分析平台☆41Jun 26, 2013Updated 12 years ago
- Go Programming Language 扫盲☆12Sep 7, 2020Updated 5 years ago
- 类似 GenerateAllSetter, 但通过后缀(即类似 .var用法)来触发!☆15Aug 3, 2022Updated 3 years ago
- 基于redis的分布式锁,适用于秒杀,自增ID等web分布式开发场景☆11Mar 21, 2017Updated 8 years ago
- kafa spring插件☆14Aug 2, 2015Updated 10 years ago
- ☆15Jul 7, 2011Updated 14 years ago
- Web Share API polyfill for Cordova☆13Oct 12, 2024Updated last year
- Java Zip Utilities Tutorial☆14Sep 16, 2013Updated 12 years ago
- This is a small demo project Using MongoDB and Spring Data☆21May 8, 2023Updated 2 years ago
- WEB 跨域postMessage() 漏洞挖掘工具,基本原理:使用AJAX 获取页面代码,结合iframe 和data 协议构造测试环境,然后在iframe 下的window.onmessage 中插入hook 监控onmessage 的参数,最后通过能否被原来的onme…☆11Sep 13, 2016Updated 9 years ago
- Pandas Helper Library for reading and writing DataFrames from and to HBase.☆10Mar 8, 2018Updated 8 years ago
- Docker - the open-source application container engine☆11May 6, 2015Updated 10 years ago
- ☆17Feb 26, 2026Updated last week
- 新浪微博互动预测☆11Jan 24, 2017Updated 9 years ago
- A derived version from SMTH pyctp☆11Nov 3, 2016Updated 9 years ago
- TypeScript/JavaScript library for generating DNSSEC proofs for the ENS DNSSEC oracle contract☆14Mar 28, 2023Updated 2 years ago
- 基于ElasticSearch的分布式舆情检索统计服务。☆11Dec 13, 2023Updated 2 years ago
- 使用了Bootstrap的税率转换工具☆12Apr 9, 2017Updated 8 years ago
- 分布式锁的几种实现方法:redis实现分布式锁☆12Dec 5, 2016Updated 9 years ago