yinzishao/NewsScrapy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yinzishao/NewsScrapy)

yinzishao / NewsScrapy

基于scrapy的新闻爬虫

☆101

Alternatives and similar repositories for NewsScrapy

Users that are interested in NewsScrapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hailong0707-zz / spider_news_all
View on GitHub
Scrapy Spider for 各种新闻网站
☆109Sep 3, 2015Updated 10 years ago
Harhao / toutiao
View on GitHub
今日头条科技新闻接口爬虫
☆17Sep 26, 2017Updated 8 years ago
jasonren0403 / news_hotspot_crawler
View on GitHub
基于scrapy的中国国内各大新闻网站内容爬虫
☆26Feb 12, 2022Updated 4 years ago
build2last / NCspider
View on GitHub
A Scrapy Project 中文门户网站新闻和评论抓取——重启维护工作
☆14Dec 26, 2022Updated 3 years ago
Felix-P-Code / scrapyweixi
View on GitHub
scrapy+selenium+phantomjs做的微信采集，遇见验证码发到打码平台
☆11Feb 2, 2017Updated 9 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Jacen789 / NewsCrawler
View on GitHub
新闻爬虫，爬取新浪、搜狐、新华网即时财经新闻。
☆196May 9, 2020Updated 6 years ago
pyorc / pyorcnews
View on GitHub
基于scrapy框架的新闻爬虫
☆11Jan 13, 2016Updated 10 years ago
sunshinenum / sina_news
View on GitHub
基于Scrapy的爬虫，爬取新浪新闻，数据库使用mysql和mongoDB附带master分支docker镜像。
☆18Aug 9, 2016Updated 9 years ago
tankle / newscrawler
View on GitHub
新闻网站爬虫,目前能够爬取网易，新浪，qq，搜狐等三家网站的新闻页面，并保存到本地。
☆34Jun 12, 2015Updated 11 years ago
wuchong / scrapy-dynamic-configurable
View on GitHub
A dynamic configurable news crawler based Scrapy
☆164Jul 24, 2017Updated 9 years ago
boss-mao / scrapy_enterprise_architecture
View on GitHub
python scrapy 企业级分布式爬虫开发架构模板
☆96Mar 1, 2018Updated 8 years ago
FrankXiong / cqunews-web
View on GitHub
利用Java网络爬虫爬取重庆大学新闻网站数据，依据解析的数据构建的新闻网站
☆11Mar 7, 2016Updated 10 years ago
lzjqsdd / NewsSpider
View on GitHub
爬取今日头条，网易，腾讯等新闻,并建立简单的搜索引擎
☆637May 14, 2024Updated 2 years ago
Google1234 / Information_retrieva_Projectl-
View on GitHub
新闻检索：爬虫定向采集3-4个网页，实现网页信息的抽取、检索和索引。网页个数不少于10个，能按时间、相关度、热度等属性进行排序，并实现相似主题的自动聚类。可以实现：有相关搜索推荐、snippet生成、结果预览(鼠标移到相关结果，能预览)功能
☆129Aug 2, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lxf44944 / sinaNews_crawler
View on GitHub
新浪新闻爬虫
☆15Feb 14, 2015Updated 11 years ago
hk029 / LagouSpider
View on GitHub
【图文详解】scrapy爬虫与动态页面——爬取拉勾网职位信息（1）
☆82Jun 2, 2016Updated 10 years ago
JetFeng / SohuSpider-Java
View on GitHub
用java写的搜狐新闻爬虫
☆14May 2, 2017Updated 9 years ago
Flowingsun007 / house_spider
View on GitHub
Lianjia house spider链家二手房爬虫~ Springboot + Webmagic + Mysql + Redis
☆27Apr 22, 2021Updated 5 years ago
liriansu-opus / wescraper
View on GitHub
依赖Scrapy和搜狗搜索微信公众号文章
☆49Mar 25, 2017Updated 9 years ago
Ingram7 / NewsinaSpider
View on GitHub
Scrapy 新浪新闻爬虫
☆12Aug 26, 2019Updated 6 years ago
sph116 / zhongxin_search
View on GitHub
中国新闻网爬虫（全站增量爬虫，可用时间至2019.7）
☆17Jul 13, 2019Updated 7 years ago
littleVege / pixiv_crawl
View on GitHub
基于Scrapy的Pixiv热榜爬虫
☆80Aug 25, 2016Updated 9 years ago
cyhleo / JinRiTouTiaoNews
View on GitHub
scrapy+pyppeteer，爬取今日头条中新闻及热门评论信息。
☆12May 6, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rama291041610 / TongHuaShun-Spider
View on GitHub
一个同花顺财经新闻的爬虫。
☆16Apr 12, 2019Updated 7 years ago
zhangslob / awesome_crawl
View on GitHub
腾讯新闻、知乎话题、微博粉丝，Tumblr爬虫、斗鱼弹幕、妹子图爬虫、分布式设计等
☆303Jun 6, 2025Updated last year
terry2tan / weiboCAR
View on GitHub
【爬虫】基于Scrapy开发的微博（评论、转发、点赞）爬虫，可以批量抓取。
☆29Dec 1, 2016Updated 9 years ago
brady-chen / tbNews
View on GitHub
金融新闻增量式聚焦爬虫
☆21Jul 17, 2017Updated 9 years ago
netxfly / dianying
View on GitHub
中国主流在线电影网站爬虫及搜索web代码
☆35Jun 9, 2014Updated 12 years ago
KFPA / ScrapyNews
View on GitHub
采用scrapy框架抓取新闻的项目
☆10Jun 8, 2018Updated 8 years ago
lihansunbai / Fang_Scrapy
View on GitHub
这是一个作者毕业设计的爬虫，爬取58同城、赶集网、链家、安居客、我爱我家网站的房价交易数据。
☆330May 4, 2016Updated 10 years ago
yanceyblog / scrapy-imdb
View on GitHub
实现爬取imdb.cn所有影视资料的scrapy爬虫
☆12Dec 27, 2016Updated 9 years ago
lawlite19 / CrawlPicture_Scrapy
View on GitHub
使用Scrapy爬虫框架爬取网页图片并保存本地
☆14Sep 11, 2016Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
orangeMask / spider
View on GitHub
抖音,淘宝系,常见新闻爬虫
☆13Apr 15, 2022Updated 4 years ago
BillBillBillBill / NewsCrawler
View on GitHub
毕业设计基于网络爬虫的新闻采集和订阅系统的设计与实现
☆84May 26, 2017Updated 9 years ago
jacoblai / Coolpy5App
View on GitHub
http://icoolpy.com
☆10Sep 10, 2016Updated 9 years ago
crystal-tensor / spide
View on GitHub
网络爬虫主要抓取的是股票数据，外汇数据，股票背景资料，股票及时新闻
☆13Aug 13, 2018Updated 7 years ago
NanoNets / DocAIAgent
View on GitHub
This code is part of a workshop conducted on how to build your own Document AI Agent using Open Source LLMs
☆16May 8, 2025Updated last year
striver-ing / internet-content-detection
View on GitHub
Python编写的爬虫框架以及特定网站的信息抓取
☆18Oct 24, 2017Updated 8 years ago
kba977 / Scrapy_Projects
View on GitHub
🕷一些Scrapy爬虫的练手项目
☆76Apr 30, 2019Updated 7 years ago