Google1234/Information_retrieva_Projectl-

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Google1234/Information_retrieva_Projectl-)

Google1234 / Information_retrieva_Projectl-

新闻检索：爬虫定向采集3-4个网页，实现网页信息的抽取、检索和索引。网页个数不少于10个，能按时间、相关度、热度等属性进行排序，并实现相似主题的自动聚类。可以实现：有相关搜索推荐、snippet生成、结果预览(鼠标移到相关结果，能预览)功能

☆129

Alternatives and similar repositories for Information_retrieva_Projectl-

Users that are interested in Information_retrieva_Projectl- are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mokizzz / SduViewWebSpider
View on GitHub
【信息检索课程设计】sdu新闻网站全站爬取+索引构建+搜索引擎
☆58May 21, 2024Updated 2 years ago
JloveU / NewsRecommenderSystems
View on GitHub
研一秋季学期《网络数据挖掘》大作业 - 新闻推荐系统
☆14Dec 27, 2015Updated 10 years ago
feixuelove1009 / mysite-hotpoint
View on GitHub
一个类似抽屉新热榜的新闻聚合分享站点
☆14Jan 3, 2017Updated 9 years ago
zyc1gq / DBSCAN_NEWS
View on GitHub
爬虫爬取网站新闻，DBCAN聚类,推荐系统......
☆15May 22, 2018Updated 8 years ago
Little-girl-1992 / RAE
View on GitHub
基于tensorflow搭建的神经网络recursive autuencode，用于实现句子聚类
☆12Jul 7, 2017Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Yueqing-Sun / QA
View on GitHub
信息检索实验：问答系统设计与实现
☆58Aug 7, 2019Updated 6 years ago
foriyte / NLPTK
View on GitHub
基于深度学习的文本分类聚类工具
☆14Jul 7, 2017Updated 9 years ago
FudanYuan / faultLocalization
View on GitHub
A project of fault localization in time series data
☆12Apr 18, 2019Updated 7 years ago
sph116 / zhongxin_search
View on GitHub
中国新闻网爬虫（全站增量爬虫，可用时间至2019.7）
☆17Jul 13, 2019Updated 7 years ago
YLonely / web-data-mining
View on GitHub
国科大网络数据挖掘新闻推荐
☆17Feb 15, 2019Updated 7 years ago
FrankXiong / cqunews-web
View on GitHub
利用Java网络爬虫爬取重庆大学新闻网站数据，依据解析的数据构建的新闻网站
☆11Mar 7, 2016Updated 10 years ago
yinzishao / NewsScrapy
View on GitHub
基于scrapy的新闻爬虫
☆101Apr 18, 2020Updated 6 years ago
howie6879 / getNews
View on GitHub
互联网新闻推荐系统(myNews)--2016全国计算机设计大赛企业命题参赛作品
☆45Apr 2, 2017Updated 9 years ago
vectorsss / news_classification
View on GitHub
卷积神经网络&&爬虫实现网易新闻自动爬取并分类
☆13Dec 8, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
dayicklp / Graduation-Project
View on GitHub
毕业设计：互联网新闻热点抽取系统
☆10May 21, 2022Updated 4 years ago
moluchase / NLPdemo
View on GitHub
NLP方面的一些小的demo,包括文本生成,文本分类,文本聚类等等,使用tensorflow实现,长期更新,欢迎指正,交流
☆13May 7, 2018Updated 8 years ago
crystal-tensor / spide
View on GitHub
网络爬虫主要抓取的是股票数据，外汇数据，股票背景资料，股票及时新闻
☆13Aug 13, 2018Updated 7 years ago
goozp / ths-spider-example
View on GitHub
完整的 scrapy 爬虫示例，爬取股票和新闻数据
☆17Aug 15, 2020Updated 5 years ago
ucasyp / ucasir
View on GitHub
基于lucene的新闻搜索引擎[中科院现代信息检索项目作业]
☆16Jul 17, 2016Updated 10 years ago
luguoyuanf / StockAnalytics
View on GitHub
每天抓取股票数据，保存到mongodb。
☆18Nov 2, 2015Updated 10 years ago
duoan / codes-scratch-crawler
View on GitHub
读书笔记《自己动手写网络爬虫》，自己敲的代码。主要记录了网络爬虫的基本实现，网页去重的算法，网页指纹算法，文本信息挖掘
☆47Jan 9, 2015Updated 11 years ago
lzjqsdd / NewsSpider
View on GitHub
爬取今日头条，网易，腾讯等新闻,并建立简单的搜索引擎
☆637May 14, 2024Updated 2 years ago
duyongan / text_process
View on GitHub
摘要、关键字、关键词组、文本相似度、分词分句（自然语言处理工具包）
☆11Aug 16, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
luzy99 / news-spider
View on GitHub
关键词式指定站点新闻爬虫
☆17Sep 19, 2020Updated 5 years ago
ienergetic / 2018-7-2
View on GitHub
基于用户行为（关键词和查看过的新闻）的个性化新闻推荐系统
☆42Jul 2, 2018Updated 8 years ago
taozhijiang / readmeinfo
View on GitHub
Feed news reader including recommend schema
☆28Jul 22, 2016Updated 10 years ago
rio-2607 / baidu_spider
View on GitHub
一个用BeautifulSoup写的简单的爬取百度搜索结果的爬虫
☆20Jul 29, 2015Updated 10 years ago
kongliang2015 / YahooNews_Classification
View on GitHub
利用python爬虫从日本雅虎网站获取新闻（政治，经济，体育等类别），对新闻文本做相似度计算，训练新闻分类模型
☆19Nov 14, 2017Updated 8 years ago
mJackie / LTY-Search
View on GitHub
基于Lucene、Servlet新闻搜索引擎
☆21Feb 23, 2018Updated 8 years ago
Family-TreeSY / SpiderList
View on GitHub
Spider Collection
☆23Aug 28, 2018Updated 7 years ago
LLLzasd / python_spider
View on GitHub
一些 Python 爬虫练习：bilibili用户信息爬取、下载工具、房天下新房二手房redis分布式爬虫、简书全站文章爬取、观察者网站首页新闻爬取、淘宝模拟登陆、淘宝搜索商品信息爬取及可视化展示、知乎问题回答信息爬取\抖音无水印视频下载
☆150Jan 18, 2025Updated last year
wuxh123 / iot_server
View on GitHub
利用epoll mqtt redis mysql mongodb 搭建的一个后台iot server。
☆20May 17, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
featherL / web_file_manager
View on GitHub
web在线文件管理工具
☆14Jul 24, 2024Updated 2 years ago
3ZY / Job_recommendation
View on GitHub
职位推荐系统
☆24Aug 27, 2016Updated 9 years ago
OliverFoh / Housing-Recommender-System
View on GitHub
基于内容相似度的房源推荐系统
☆12Jul 4, 2021Updated 5 years ago
YifanTian / Tap-News
View on GitHub
A News Scraping and Recommendation System using React, Node.js, MongoDB, and TensorFlow.
☆10May 9, 2018Updated 8 years ago
Zessay / NLP-Pytorch-Template
View on GitHub
适用于常见的NLP任务的模板
☆35Mar 24, 2023Updated 3 years ago
fztfztfzt / Generated-character-relation
View on GitHub
根据文本和角色名字典，生成人物关系文件，利用Gephi可生成网络图
☆15Aug 25, 2019Updated 6 years ago
liangyangtao / ubk_weixinbysogou
View on GitHub
一个根据搜狗微信进行微信公众号采集的程序
☆16Nov 12, 2015Updated 10 years ago