backto17/SinaHouseCrawler

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/backto17/SinaHouseCrawler)

backto17 / SinaHouseCrawler

基于scrapy,scrapy-redis实现的一个分布式网络爬虫,爬取了新浪房产的楼盘信息及户型图片,实现了常用的爬虫功能需求.

☆40

Alternatives and similar repositories for SinaHouseCrawler

Users that are interested in SinaHouseCrawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

asen477 / scrapy
View on GitHub
📚Scrapy：网站爬虫框架库
☆12Aug 15, 2020Updated 5 years ago
phospher / WeiboSpider
View on GitHub
A Spider for grapping weibo text from weibo(Sina, Tencent and so on)
☆21Oct 25, 2013Updated 12 years ago
sunshinenum / sina_news
View on GitHub
基于Scrapy的爬虫，爬取新浪新闻，数据库使用mysql和mongoDB附带master分支docker镜像。
☆18Aug 9, 2016Updated 9 years ago
build2last / NCspider
View on GitHub
A Scrapy Project 中文门户网站新闻和评论抓取——重启维护工作
☆14Dec 26, 2022Updated 3 years ago
C1tas / DiscuzX3.2_SSRF_EXP
View on GitHub
☆11Jun 25, 2016Updated 10 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lawlite19 / CrawlPicture_Scrapy
View on GitHub
使用Scrapy爬虫框架爬取网页图片并保存本地
☆14Sep 11, 2016Updated 9 years ago
alexayan / weibo_repost_python
View on GitHub
抓取微博转发关系数据，weibo repost
☆10Nov 16, 2015Updated 10 years ago
conwaywang / jiemeibang
View on GitHub
python实现采集数据并发表到论坛中。涉及数据的爬取分析，discuz论坛的登录、发帖及回复等
☆40Jan 2, 2014Updated 12 years ago
sanand0 / benchmarks
View on GitHub
Various benchmark tests
☆13May 10, 2015Updated 11 years ago
newliver / tencentyun-porndetect
View on GitHub
万象优图智能鉴黄Python SDK（非官方）
☆13Nov 24, 2015Updated 10 years ago
Harhao / toutiao
View on GitHub
今日头条科技新闻接口爬虫
☆17Sep 26, 2017Updated 8 years ago
KDF5000 / RSpider
View on GitHub
一个基于scrapy-redis的分布式爬虫模板
☆43Jul 4, 2017Updated 9 years ago
xurenlu / hyer
View on GitHub
vertical search crawler
☆38Jan 9, 2012Updated 14 years ago
lengyingzi / markdown-haed-number
View on GitHub
VScode 插件，标题自动增加序号
☆12Mar 3, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wsb200514 / 34miao_web
View on GitHub
业余时间用Django开发了网站三四秒，这是源码
☆21Jun 7, 2018Updated 8 years ago
Chengel-HaltuD / Android-WeiXin-QiangHongBao
View on GitHub
微信抢红包外挂
☆12Jul 19, 2016Updated 10 years ago
KeithYue / weibo-keywords-crawler
View on GitHub
Crawl the related sina weibo content using the keywords, and save the results to txt file for future use.
☆18Oct 20, 2016Updated 9 years ago
Ncerzzk / weibo
View on GitHub
login weibo
☆18Feb 24, 2015Updated 11 years ago
HaiQW / webspiders
View on GitHub
基于Scrapy的网络（微薄and知乎)爬虫(A weibo spider written in Scrapy)
☆16Apr 19, 2016Updated 10 years ago
shikanon / proxy_scrapy
View on GitHub
proxy_scrapy是一个scrapy搭建的代理模块，主要包括代理抓取、代理测试和使用代理三个模块。包括了对主要的代理网站的抓取和代理稳定性的测试，并整合进scrapy爬虫当中。
☆10Jan 20, 2017Updated 9 years ago
KDF5000 / SpiderRef
View on GitHub
爬虫资料汇总
☆17Dec 5, 2015Updated 10 years ago
MOON-CLJ / scrapy_weibo
View on GitHub
distributed crawler for weibo
☆22May 23, 2013Updated 13 years ago
hzq1995 / ICAR
View on GitHub
BILIBILI.
☆15Jan 6, 2019Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jiehua233 / ipproxy
View on GitHub
代理IP提取工具
☆115Sep 7, 2017Updated 8 years ago
wwj718 / jobSpider
View on GitHub
jobSpider是一只scrapy爬虫，用于爬取职位信息
☆28Aug 14, 2016Updated 9 years ago
dagege1993 / scrapy
View on GitHub
1,huaproject算福利吧,爬取的中国校花网,并且保存到本地,基础知识点,url,json,文件的读写. 2,Document.doc 是自己总结的常见爬虫面试题以及答案,但是貌似不想做全职爬虫,所以可能以后也不会更新这一块,爬虫算乐趣, 以后估计重心会放在web …
☆14Jan 24, 2018Updated 8 years ago
qqxx6661 / flask_yzd
View on GitHub
旧版某东监控网站前后端，轻量级Flask网站，可用作学习Flask
☆74Feb 15, 2023Updated 3 years ago
gpxlcj / weibospider
View on GitHub
A Web Spider for Weibo(Chinese Twitter)
☆18Aug 12, 2015Updated 10 years ago
understar / Stock-Science
View on GitHub
研究一下大数据支撑下的股票科学
☆12Oct 12, 2015Updated 10 years ago
burkun / Sinaweibo
View on GitHub
新浪微博模拟登录和自动发微博，带图片微博的python脚本，使用opencv实现读取摄像头上传图片到微博。
☆21Feb 27, 2018Updated 8 years ago
wuyue92tree / crwy
View on GitHub
一个简单的web爬虫框架，借鉴scrapy结构开发而来，并为scrapy使用者提供通用轮子^.^
☆13Nov 9, 2020Updated 5 years ago
gudaoxuri / keyword-extract
View on GitHub
简单高效的URL关键词提取工具
☆15Nov 13, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Nikolai10 / mobile-ocr
View on GitHub
Camera-based Document Analysis
☆26Jul 7, 2025Updated last year
PyCN / -scrapy-
View on GitHub
仿造scrapy制作轻量级爬虫框架，旨在提升编程能力
☆20Jan 29, 2017Updated 9 years ago
wanghuafeng / baidu_spider
View on GitHub
百度爬虫：热词，词频，音乐，poi信息
☆21Mar 10, 2015Updated 11 years ago
TeamHG-Memex / tor-proxy
View on GitHub
a tor socks proxy docker image
☆12Apr 8, 2026Updated 3 months ago
imflyn / decoration-design-crawler
View on GitHub
土巴兔和谷居装修网站爬虫
☆108Jul 26, 2019Updated 7 years ago
Flowerowl / pylinktester
View on GitHub
A multi-thread website link detector
☆22Feb 8, 2014Updated 12 years ago
aschmahmann / dht-graph
View on GitHub
A simple libp2p DHT crawler
☆16Jan 6, 2022Updated 4 years ago