IshtarTang/weibo_spider-scrapy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IshtarTang/weibo_spider-scrapy)

IshtarTang / weibo_spider-scrapy

微博爬虫，爬取用户发布的所有博文和博文下的评论，使用scrapy 框架

☆38

Alternatives and similar repositories for weibo_spider-scrapy

Users that are interested in weibo_spider-scrapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gyqlr / weibo_spider
View on GitHub
微博爬虫，爬去微博语料，情感分析，user-agent池，充足IP，scrapy，mongodb
☆15Aug 23, 2018Updated 7 years ago
IshtarTang / weibo_spider
View on GitHub
新浪微博爬虫，保存一个用户发过的所有内容，保存包括原链接、正文、评论等（微博换新UI同时也换了数据接口，该项目已无法使用，针对新接口的爬虫见主页weibo_spider-scrapy）
☆20Nov 13, 2021Updated 4 years ago
lopezbec / COVID19_Tweets_Dataset_2020
View on GitHub
This dataset contains all the 2020 COVID-19 related data from the paper "An Augmented Multilingual Twitter Dataset for Studying the COVID…
☆11Jan 20, 2022Updated 4 years ago
Wenzhi-Ding / Weibo-Crawler
View on GitHub
☆11Feb 26, 2023Updated 3 years ago
bloossoms / xiecheng_hotel_reviews_spider
View on GitHub
基于selenium的携程酒店评论爬取
☆13May 10, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shuai19980 / Email_cheat_L
View on GitHub
邮件伪造+批量发送邮件钓鱼专用
☆17Jun 20, 2022Updated 4 years ago
duanxinyuan / cipher-utils
View on GitHub
加密解密工具库，包含大部分对称加密、非对称加密、摘要/杂凑算法、hash算法以及各种编码算法
☆18Dec 19, 2025Updated 7 months ago
nju-kaoyan / nju_cs_22
View on GitHub
☆13Apr 18, 2022Updated 4 years ago
zxins / hotfish
View on GitHub
获取知乎、V2EX、微博、贴吧、IT之家、豆瓣、虎扑、天涯、GitHub等网站热门头条的多线程爬虫，使用Flask聚合网站。
☆34Feb 16, 2023Updated 3 years ago
deathpooool / BUAA-961-xmind
View on GitHub
北航考研961专业课思维导图（2021）
☆12Dec 27, 2021Updated 4 years ago
NiShuang / new_media_fans_cralwer
View on GitHub
facebook，微博，twitter，youtube，优酷信息爬虫
☆22Sep 3, 2018Updated 7 years ago
XiangLinPro / ECommerceCrawlers
View on GitHub
实战🐍多种网站、电商数据爬虫🕷。包含🕸：淘宝商品、微信公众号、大众点评、企查查、招聘网站、闲鱼、阿里任务、博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛…
☆28Aug 6, 2020Updated 5 years ago
sasaju / NormalSchedule
View on GitHub
河大课表：适用于河北大学本科生和研究生的课程表APP
☆16Dec 26, 2023Updated 2 years ago
aidenwang9867 / Weibo-User-Depression-Detection-Dataset
View on GitHub
A published large-scale dataset - Weibo User Depression Detection Dataset.
☆110Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
1171800312 / compiler_principles_hit_2020
View on GitHub
哈尔滨工业大学（哈工大）编译原理（编译系统）2020春课程仓库，包含三次实验及期末考试试卷。总体来讲期末考试题目比较简单（因为难的东西比如数据流都没考），实验难度很大，如果自己都实现的话一定会有不少收获。
☆20Sep 14, 2020Updated 5 years ago
thunlp / COVID19-CountryImage
View on GitHub
Dataset of China's-image-related tweets during COVID-19 with aspect-level sentiment labels.
☆17Feb 2, 2021Updated 5 years ago
jin-taiyu / Bnalyser
View on GitHub
项目基于先进的BERT模型，旨在解决社交媒体上的个性化信息分析难题。随着社交媒体数据的爆发增长，我们利用BERT的语义理解能力，提供情感分析和文本分类功能。这个平台可以帮助个人、企业和政府机构更好地理解用户需求，提供个性化推荐和决策支持。
☆20Jan 16, 2026Updated 6 months ago
BarryYin / chat-on-wechat
View on GitHub
基于大模型搭建的微信聊天机器人，同时支持微信、企业微信、公众号、飞书、钉钉接入，可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业…
☆15Mar 11, 2024Updated 2 years ago
iAbdullahMughal / espionage
View on GitHub
A basic python based tool for domain ℹ️ information gathering. I am working 💻 on collecting information related to domain whois, history…
☆13Jan 11, 2026Updated 6 months ago
AnsGoo / djangoMultiTenant
View on GitHub
Django 多租户实现，实现了数据库层的租户数据隔离，兼容django.auth、admin、migrate等模块，支持rest_framework,支持django默认支持的所有数据库
☆16Jan 8, 2023Updated 3 years ago
fxyz999 / cnfunds
View on GitHub
基金信息大全
☆14Apr 6, 2025Updated last year
9ayhub / simple-search-engine
View on GitHub
flask+bootstrap实现的web小应用，实现了全文检索（拼写检查及纠错、倒排索引、tf-idf文档排序）和文章浏览（文章简介、阅读原文）
☆16Dec 8, 2022Updated 3 years ago
LeonWang91 / Google-Spyder
View on GitHub
Google搜索引擎关键词检索结果抓取
☆16Aug 25, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
marwincn / pubsenti-finder
View on GitHub
微博评论情感分析，爬虫，文本分类，Web。
☆45Nov 13, 2025Updated 8 months ago
Mcliuyi / Light-Short-text-product-classification
View on GitHub
淘宝，京东，苏宁Scrapy爬虫
☆10Dec 8, 2022Updated 3 years ago
hlpureboy / PYphishing-Email
View on GitHub
python 邮件钓鱼
☆40Mar 2, 2018Updated 8 years ago
CPJ31415 / -AI
View on GitHub
五子棋人机博弈，极大极小值，剪枝，启发式搜索
☆10Nov 7, 2020Updated 5 years ago
Yiuman / quaScrapy
View on GitHub
去哪儿网爬虫（景区与景区评论）
☆10Jul 1, 2019Updated 7 years ago
1414044032 / Sina_Spider
View on GitHub
新浪爬虫，基于Python+Selenium。模拟登陆后保存cookie，实现登录状态的保存。可以通过输入关键词来爬取到关键词相关的热门微博。
☆30Aug 21, 2018Updated 7 years ago
ctts / TopSearch
View on GitHub
基于node.js的抓取微博、百度热搜、知乎日报、bilibili等热榜榜爬虫
☆27Dec 22, 2022Updated 3 years ago
YushengAuggie / Tianyancha
View on GitHub
天眼查的快速傻瓜爬虫脚本。输入目标企业的模糊名称/简称，即可将目标企业的工商信息分门别类地保存为Excel文件。
☆22May 23, 2018Updated 8 years ago
Cheereus / WeiboEmotionAnalyzer
View on GitHub
根据关键词爬取微博内容并进行情感分析
☆16Mar 18, 2020Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
hlwgy / juejin_rpa
View on GitHub
RPA+AI，基于图像识别的打开滑块验证码
☆23Aug 22, 2023Updated 2 years ago
I4Can / ShoppingWebsiteCrawlwer
View on GitHub
基于关键字的配置化电商爬虫，目前已实现京东和苏宁（淘宝反爬太严重，因为没有使用selenium）
☆12Jun 3, 2020Updated 6 years ago
kingabzpro / FastAPI-ML-Project
View on GitHub
Learning and buiding API using Fast API
☆16Aug 7, 2021Updated 4 years ago
hahaha108 / JDSpider
View on GitHub
京东爬虫，可以实现输入一个关键字后自动爬取相关的商品信息，也可以用于自定义爬取商品的评论。
☆11Mar 23, 2018Updated 8 years ago
Henryhaohao / Xiecheng_Comment
View on GitHub
Xiecheng_Comment多线程Threading爬取携程的丽江古城景点评论并生成词云
☆25Oct 20, 2018Updated 7 years ago
HIT-UOI-SR / HIT-Graduate-Report
View on GitHub
哈尔滨工业大学研究生报告LaTeX模板
☆11Jul 24, 2021Updated 5 years ago
keleqnma / flask-vuejs-nlp
View on GitHub
flask+vue 期末大作业，一个有nlp分析文本功能的爬虫小说网站
☆14Jan 4, 2023Updated 3 years ago