xianhu/PSpider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xianhu/PSpider)

xianhu / PSpider

简单易用的Python爬虫框架，QQ交流群：597510560

☆1,837

Alternatives and similar repositories for PSpider

Users that are interested in PSpider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

howie6879 / ruia
View on GitHub
Async Python 3.6+ web scraping micro-framework based on asyncio
☆1,739Jul 1, 2023Updated 3 years ago
xianhu / LearnPython
View on GitHub
以撸代码的形式学习Python
☆8,507Jan 5, 2025Updated last year
xchaoinfo / fuck-login
View on GitHub
模拟登录一些知名的网站，为了方便爬取需要登录的网站
☆5,868Jun 8, 2018Updated 8 years ago
SpiderClub / weibospider
View on GitHub
A distributed crawler for weibo, building with celery and requests.
☆4,794Jul 11, 2020Updated 6 years ago
chyroc / WechatSogou
View on GitHub
基于搜狗微信搜索的微信公众号爬虫接口
☆6,339Mar 7, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jhao104 / proxy_pool
View on GitHub
Python ProxyPool for web spider
☆23,488Jun 15, 2026Updated last month
luyishisi / Anti-Anti-Spider
View on GitHub
越来越多的网站具有反爬虫特性，有的用图片隐藏关键数据，有的使用反人类的验证码，建立反反爬虫的代码仓库，通过与不同特性的网站做斗争（无恶意）提高技术。（欢迎提交难以采集的网站）（因工作原因，项目暂停）
☆7,286Oct 17, 2021Updated 4 years ago
lining0806 / PythonSpiderNotes
View on GitHub
Python入门网络爬虫之精华版
☆7,455Jun 21, 2021Updated 5 years ago
Kr1s77 / awesome-python-login-model
View on GitHub
😮python模拟登陆一些大型网站，还有一些简单的爬虫，希望对你们有所帮助❤️，如果喜欢记得给个star哦🌟
☆16,230Jul 26, 2022Updated 3 years ago
binux / pyspider
View on GitHub
A Powerful Spider(Web Crawler) System in Python.
☆16,802Apr 30, 2024Updated 2 years ago
qiyeboy / IPProxyPool
View on GitHub
IPProxyPool代理池项目，提供代理ip
☆4,279Jul 13, 2018Updated 8 years ago
my8100 / scrapydweb
View on GitHub
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.…
☆3,410Feb 19, 2025Updated last year
gnemoug / distribute_crawler
View on GitHub
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
☆3,243Apr 18, 2017Updated 9 years ago
SpiderClub / haipproxy
View on GitHub
High available distributed ip proxy pool, powerd by Scrapy and Redis
☆5,535Dec 26, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
chrislinan / cx-extractor-python
View on GitHub
基于行块分布函数的通用网页正文抽取算法的Python版本实现，添加了英文支持/ Web page content extraction algorithm, support both Chinese and English
☆482Jul 9, 2019Updated 7 years ago
LeetaoGoooo / MovieHeavens
View on GitHub
🎬 基于Pyqt5的简单电影搜索工具
☆653Oct 11, 2022Updated 3 years ago
hangsz / pandas-tutorial
View on GitHub
适合初级到中级晋升者，有了体系之后就看熟练度了。
☆1,891Mar 30, 2024Updated 2 years ago
jukanntenn / django-blog-tutorial
View on GitHub
基于 Python3.5 和 Django 1.10 的 Django Blog 项目。
☆2,358Jun 10, 2021Updated 5 years ago
littlecodersh / ItChat
View on GitHub
A complete and graceful API for Wechat. 微信个人号接口、微信机器人及命令行微信，三十行即可自定义个人号机器人。
☆26,478Sep 28, 2023Updated 2 years ago
MikeChongCan / scylla
View on GitHub
Intelligent proxy pool for Humans™ to extract content from the internet and build your own Large Language Models in this new AI era
☆4,018Jun 9, 2025Updated last year
LiuXingMing / SinaSpider
View on GitHub
新浪微博爬虫（Scrapy、Redis）
☆3,284Sep 5, 2018Updated 7 years ago
jobbole / awesome-python-cn
View on GitHub
Python资源大全中文版，包括：Web框架、网络爬虫、模板引擎、数据库、数据可视化、图片处理等，由「开源前哨」和「Python开发者」微信公号团队维护更新。
☆30,491Aug 29, 2022Updated 3 years ago
aosabook / 500lines
View on GitHub
500 Lines or Less
☆29,578Aug 19, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
flaggo / pydu
View on GitHub
Useful data structures and utils for Python.
☆338Jun 17, 2026Updated last month
qiwsir / algorithm
View on GitHub
☆3,301Jun 8, 2022Updated 4 years ago
jtyoui / PyUnit
View on GitHub
搜狗词库下载、新词发现算法、常见的工具类、百度应用、翻译、天气预报、汉语纠错、字符串文本数据提取时间解析、百度文库下载、实体抽取等等
☆726Mar 24, 2022Updated 4 years ago
soimort / you-get
View on GitHub
Dumb downloader that scrapes the web
☆56,852Jul 6, 2026Updated last week
elliotgao2 / toapi
View on GitHub
Every web site provides APIs.
☆3,555Jul 10, 2026Updated last week
bowenpay / wechat-spider
View on GitHub
微信公众号爬虫
☆3,359Aug 10, 2021Updated 4 years ago
FullerHua / gooseeker
View on GitHub
☆694Oct 26, 2016Updated 9 years ago
GeneralNewsExtractor / GeneralNewsExtractor
View on GitHub
新闻网页正文通用抽取器 Beta 版.
☆3,788Apr 21, 2026Updated 2 months ago
piglei / one-python-craftsman
View on GitHub
来自一位 Pythonista 的编程经验分享，内容涵盖编码技巧、最佳实践与思维模式等方面。
☆7,209May 16, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
rmax / scrapy-redis
View on GitHub
Redis-based components for Scrapy.
☆5,643May 19, 2026Updated last month
fake-useragent / fake-useragent
View on GitHub
Up-to-date simple useragent faker with real world database
☆4,056Mar 29, 2026Updated 3 months ago
fate0 / getproxy
View on GitHub
getproxy 是一个抓取发放代理网站，获取 http/https 代理的程序
☆830Aug 2, 2022Updated 3 years ago
da2vin / Sasila
View on GitHub
一个灵活、友好的爬虫框架
☆296Jul 6, 2022Updated 4 years ago
awolfly9 / IPProxyTool
View on GitHub
python ip proxy tool scrapy crawl. 抓取大量免费代理 ip，提取有效 ip 使用
☆2,000Dec 8, 2022Updated 3 years ago
lanbing510 / DouBanSpider
View on GitHub
豆瓣读书的爬虫
☆2,788Apr 8, 2020Updated 6 years ago
crawlab-team / crawlab
View on GitHub
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架
☆12,246Feb 10, 2026Updated 5 months ago