linlin0212/scrapy-selenium-SinaSpider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linlin0212/scrapy-selenium-SinaSpider)

linlin0212 / scrapy-selenium-SinaSpider

利用Scrapy+Selenium爬取新浪微博热点事件的博文与评论

☆40

Alternatives and similar repositories for scrapy-selenium-SinaSpider

Users that are interested in scrapy-selenium-SinaSpider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harrywang / scrapy-selenium-demo
View on GitHub
a demo of scrapy + selenium
☆19Sep 18, 2019Updated 6 years ago
keyucui / weibo_topic_analyze
View on GitHub
关注于某个大的话题，按关键字搜索总话题，分为各个分话题，在每个分话题下爬取多条热门微博及其评论数据，保证内容和评论的多样性
☆18Dec 22, 2020Updated 5 years ago
LakiLiu / Covid-19-Analysis
View on GitHub
数据集依据与“新冠肺炎”相关的230个主题关键词进行数据采集，抓取了2020年1月1日—2020年2月20日期间共计100万条微博数据，并对其中10万条数据进行人工标注，标注分为三类，分别为：1（积极），0（中性）和-1（消极）
☆18Dec 11, 2020Updated 5 years ago
egdw / SnowNLP_Movie
View on GitHub
基于SnowNLP的三百万电影数据的影评情感预测
☆12Jul 15, 2020Updated 6 years ago
KeluYao / Spider
View on GitHub
天猫商城、京东商城、汽车之家、新浪微博、百度贴吧、知乎、（百度、新浪、腾讯）明星库、中关村在线、360手机助手、应用宝等大型网站爬虫
☆20Jun 9, 2018Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
justinchuntingho / songotsti
View on GitHub
A Package for Cantonese Tokenisation
☆18Jun 17, 2021Updated 5 years ago
frederichchen / ToySpiders
View on GitHub
自制Python玩具小爬虫，用来爬取失信被执行人、专利等数据
☆23Jul 31, 2020Updated 5 years ago
Ingram7 / WeiboSearch
View on GitHub
Scrapy 新浪微博搜索爬虫
☆17Aug 26, 2019Updated 6 years ago
Python3WebSpider / AjaxHookSpider
View on GitHub
Ajax Hook Demo
☆31Jun 1, 2020Updated 6 years ago
halfrost / docker_practice
View on GitHub
Learn and understand Docker technologies, with real DevOps practice!
☆19Dec 7, 2017Updated 8 years ago
saermart / WeiboClient
View on GitHub
集成微博数据采集、账户操作(发视频、发微博、发评论等)
☆41Jan 14, 2023Updated 3 years ago
CongSun-dlut / BioBERT-MRC
View on GitHub
Data and codes for BioBERT-MRC
☆11Oct 5, 2021Updated 4 years ago
skolo-online / django-image-gallery
View on GitHub
☆11Jul 25, 2024Updated 2 years ago
Randy-whiteSugar / LagouSpider_Scrapy
View on GitHub
使用Scrapy编写的拉勾网爬虫，添加了代理IP池、增量爬取机制
☆11May 22, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
liseami / ChunxiangClassCode
View on GitHub
纯想0基础全栈开发课程
☆10Dec 9, 2024Updated last year
backslash112 / book_scraper_python
View on GitHub
A demo to use the BeautifulSoup Python package to get the book informations from websites
☆15Oct 2, 2020Updated 5 years ago
fzyzcjy / ai_math_paper_list
View on GitHub
AI for Mathematics Paper List
☆17Jan 14, 2025Updated last year
joel-bentley / pinterest-clone
View on GitHub
A Pinterest clone built using Python and Flask
☆15Jan 27, 2017Updated 9 years ago
OSinoooO / MeituanSpider
View on GitHub
美团爬虫，基于scrapy_redis
☆22Apr 1, 2019Updated 7 years ago
mrhieu / ionic-cnn
View on GitHub
CNN News App on Ionic Framework (5)
☆12Jul 13, 2023Updated 3 years ago
cdalvaro / ruby-notebooks
View on GitHub
💎 A series of Jupyter notebooks for learning the Ruby programming language
☆11May 24, 2021Updated 5 years ago
alokVerma749 / Next-Blog-App
View on GitHub
This is a blogging website consisting of Admin support.
☆11Feb 27, 2023Updated 3 years ago
shikanon / proxy_scrapy
View on GitHub
proxy_scrapy是一个scrapy搭建的代理模块，主要包括代理抓取、代理测试和使用代理三个模块。包括了对主要的代理网站的抓取和代理稳定性的测试，并整合进scrapy爬虫当中。
☆10Jan 20, 2017Updated 9 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
NightCatSama / jQuery-Pinterest
View on GitHub
基于jQuery编写的瀑布流图片墙插件
☆10Apr 25, 2016Updated 10 years ago
aring1998 / human-benchmark
View on GitHub
Vue+Express开发的全栈Web项目，通过脑力游戏和认知测试来衡量您的各项能力
☆15Apr 30, 2026Updated 2 months ago
kuleafenu / customizable-web-crawler
View on GitHub
This web crawler can be customized to scrape almost all types of websites.
☆11Dec 31, 2021Updated 4 years ago
Day-Bright / caipanwenshu_spider
View on GitHub
selenium裁判文书网爬虫，文书网登录
☆41Jun 5, 2022Updated 4 years ago
uds-lsv / anea
View on GitHub
☆19Apr 28, 2021Updated 5 years ago
itversity / spark-sql
View on GitHub
Apache Spark using SQL
☆14Aug 18, 2021Updated 4 years ago
toyobayashi / mp-handle
View on GitHub
小程序版的汉兜
☆13Jan 29, 2024Updated 2 years ago
qiaolinwang / WHU_DB
View on GitHub
基于Pymysql和Pyqt的图书管理系统 Book Management System based on Pymysql and Pyqt
☆12Jan 13, 2023Updated 3 years ago
wlabatey / job_scraper
View on GitHub
A job scraper using the Scrapy framework
☆16Oct 20, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ewheeler / rapidsms-ussd-app
View on GitHub
RapidSMS app for using USSD services (check SIM balance, transfer airtime)
☆15Aug 5, 2010Updated 15 years ago
codeT / socialEngineer
View on GitHub
常用字典
☆15Mar 19, 2016Updated 10 years ago
saravanakumargn / RNComponentsExplorerSourceView
View on GitHub
☆14Jan 26, 2023Updated 3 years ago
twlite / node-xvdl
View on GitHub
🔞 Video downloader for xvideos.com written in pure JavaScript.
☆10May 26, 2021Updated 5 years ago
IdeaTry / vue-tetris
View on GitHub
俄罗斯方块
☆12Sep 13, 2016Updated 9 years ago
wlhost / SMS_Receive-server-client
View on GitHub
sms_receive flask open souce code.
☆12Oct 1, 2020Updated 5 years ago
rjadr / django-pdf-flipbook
View on GitHub
A Django app that displays pdf files in a grid and lets you read them as flipbooks
☆14Mar 3, 2026Updated 4 months ago