面向证券信息类专业搜索引擎,基于WEB信息挖掘技术的专业搜索引擎设计与实现并着重分析基于特定主题的爬取方法,通过下载Internet上WEB文档,进行过滤、分词、转换等处理工作,并建立索引数据库,最终可由检索器通过用户输入查询关键字,搜索器支持微博客、短信等内容短小而又不规范的内容分析。针对证券信息类网站选取40-50个,进行数据挖掘,使用概念索引方法进行WEB全文索引,程序设计中考虑了搜索策略和搜索引擎数据优化等问题,同时特征词汇按照影响权重排序,可以输出权重值。对基于WEB挖掘的中文专业搜索引擎的设计与实现具有较好的理论与实验价值。
☆24Dec 3, 2018Updated 7 years ago
Alternatives and similar repositories for niusouyixia
Users that are interested in niusouyixia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Django的的微博转发分析系统☆14Oct 26, 2018Updated 7 years ago
- Docker - the open-source application container engine☆11May 6, 2015Updated 11 years ago
- 微博SDK(目前支持新浪微博和腾讯微博)☆19Aug 1, 2012Updated 13 years ago
- 基于ElasticSearch的分布式舆情检索统计服务。☆11Dec 13, 2023Updated 2 years ago
- 使用扩展的通过数据库维护的IKAnalyzer和分布式搜索搜索服务SolrCloud及SolrJ的ShowCase。☆12Aug 28, 2014Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CTP with python3.4.3☆10Aug 1, 2017Updated 8 years ago
- alibaba druid aggregated monitor☆10Sep 1, 2017Updated 8 years ago
- convert weibo(sina/tencent/netease) data source into an intermediate format supported by citespace☆10Sep 27, 2011Updated 14 years ago
- 猫头鹰搜索引擎,爬虫,分词,索引,搜索☆27Jul 23, 2015Updated 10 years ago
- zookeeper配置中心,毫秒级别配置更新配置☆15Dec 16, 2022Updated 3 years ago
- python版挖掘鸡☆12Nov 19, 2015Updated 10 years ago
- ☆10Apr 4, 2018Updated 8 years ago
- 抓取微博转发关系数据,weibo repost☆10Nov 16, 2015Updated 10 years ago
- 新浪微博互动预测☆11Jan 24, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于谷歌大规模网页去重simhash算法,对海量文章(长文本)进行去重。☆11Dec 8, 2022Updated 3 years ago
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated 2 years ago
- 爬取微博数据形成用户画像 登陆账号获取cookies 使用selenium,先调用chrome浏览器 最后改成PhantomJS,并根据其中的内容获取想要的数据☆11Mar 7, 2019Updated 7 years ago
- A mainline kernel with patches runs on YeeLoong 8089D / Loongson 2F.☆19Jun 4, 2018Updated 7 years ago
- Flymaple - Another Quadcopter in Open Source way.☆19Jan 28, 2014Updated 12 years ago
- Golang对接宝付、通联、富友金账户等支付平台☆13Mar 29, 2023Updated 3 years ago
- Track the keyword positions☆19Oct 26, 2013Updated 12 years ago
- 目前生产环境使用的elasticsearch☆10Apr 29, 2014Updated 12 years ago
- Discuz Q.☆16Dec 23, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 使用Qt,通过zmq的C++接口进一步封装,并提供基于多对一的服务器模式与一对多的推送模式的客户端与服务器实现。☆12Oct 7, 2013Updated 12 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- ☆15Jul 7, 2011Updated 14 years ago
- ☆12May 3, 2024Updated 2 years ago
- ☆11Jun 29, 2017Updated 8 years ago
- 文本特征值提取,采用结巴将文本分词,tf-idf算法得到特征值,以及给出了idf词频文件的训练方法☆21Feb 11, 2017Updated 9 years ago
- 新浪微博的爬取,监控目标微博所发内容☆10Apr 13, 2017Updated 9 years ago
- 本项目包含几种常用 NLP算法的实现:关键词(keyword)、命名实体(named entity)、自动摘要(abstract)、文本相似度比较(text similarity)等☆16Jan 16, 2022Updated 4 years ago
- A library for storing encrypted data in Mongo☆38Nov 10, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Performing Latent Semantic Analysis with Python on large datasets.☆13Jun 21, 2022Updated 3 years ago
- class热部署☆11Jan 31, 2015Updated 11 years ago
- 新浪热门微博爬虫,外加词云分析。☆19Mar 29, 2018Updated 8 years ago
- Jenkins Maven Repository Plugin☆20Oct 14, 2015Updated 10 years ago
- Control Android, Linux and Windows☆16Feb 26, 2016Updated 10 years ago
- 一个基于elasticsearch开发的搜索引擎网站☆14Nov 22, 2022Updated 3 years ago
- Download stock prices, from netease.☆13Aug 24, 2016Updated 9 years ago