本爬虫用于爬取知乎网站问题、回答的相关字段信息,问题的标题、内容、发布时间、话题、回答数量、评论数、点击数、关注数等字段,及对该问题回答的内容,作者、点赞数、评论数、回答时间等等字段信息。可用于对社会话题、热点进行数据分析。
☆42Nov 30, 2018Updated 7 years ago
Alternatives and similar repositories for zhihuSpider
Users that are interested in zhihuSpider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 知乎爬虫,可以爬取知乎上特定问题下的所有回答、单个回答,特定用户的所有回答、文章,话题精华,收藏夹,专栏,文章☆73Sep 27, 2019Updated 6 years ago
- GOAT(山羊)是中英文大语言模型,基于LlaMa进行SFT。☆12Apr 24, 2023Updated 2 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- 基于哔哩哔哩用户评论的文本情感分析☆14Sep 2, 2023Updated 2 years ago
- 抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、搜索、关注、粉丝、合集、单作品。支持抖音号查询信息(精确粉丝数)。支持搭建API。接口版:post分支☆23Jul 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Homework exercises from the "Understanding Cryptography" textbook and accompanying lecture series.☆27Mar 25, 2018Updated 8 years ago
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆26Dec 2, 2018Updated 7 years ago
- 2020 阿里云天池大数据竞赛-中医药文献问题生成挑战赛☆30Sep 2, 2021Updated 4 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29May 13, 2020Updated 5 years ago
- 爬取知乎个人主页的想法、文篇和回答☆66May 1, 2025Updated 10 months ago
- extract the time domain or frequent domain features from wav format audio☆34Oct 3, 2019Updated 6 years ago
- Implementations of various sentiment analysis methods in Python.☆33Nov 10, 2017Updated 8 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆34Feb 2, 2026Updated last month
- Emotion detection on multiparty dialogue.☆40Apr 13, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 阿里天池AI安全挑战第一期人脸识别攻击☆10Jun 26, 2020Updated 5 years ago
- 本项目是一个微博爬虫项目,旨在通过微博的mid获取到其对应的所有点赞、转发、评论与二级评论的相关数据。☆57Oct 14, 2022Updated 3 years ago
- 识别网站cms指纹☆12May 19, 2019Updated 6 years ago
- 自写爬虫爬取知乎问题及回答☆39Jun 10, 2019Updated 6 years ago
- 基于BERT模型的中文文本情感分类☆40Oct 29, 2022Updated 3 years ago
- 基于appium的app自动遍历工具☆11May 8, 2019Updated 6 years ago
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆41Feb 26, 2023Updated 3 years ago
- “谛听”(discern)资产识别分析平台,一个简化版的物联网设备信息安全搜索引擎,IOT—Scanner的迭代优化版本。目前集成了主机发现、端口扫描、设备识别、漏洞匹配、poc验证等功能。☆17Feb 6, 2021Updated 5 years ago
- Text adventure game engine, supports script block collaboration, will be convenient for social friends to relay creation 为进化社游戏项目写的文字冒险游戏…☆11Jul 23, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 开源QG系统(Question Generation,问题生成),基于Pytorch和Transformer编写☆55Jul 25, 2024Updated last year
- 基于mongodb存储,redis缓存,celery 实现的分布式爬虫。☆13Dec 7, 2022Updated 3 years ago
- Anti-hacking tools deploying configuration for Wordpress☆13May 27, 2020Updated 5 years ago
- A simple yaml-based xpath crawler framework for easy tracking site updates. https://zhupeng.github.io/☆21Mar 1, 2024Updated 2 years ago
- 生死簿管理系统☆11Jun 23, 2019Updated 6 years ago
- 【🔞这个项目废弃了,主要迁移到autocronjob项目,欢迎大家去使用】dev_task任务管理平台,实现了类似crontab定时执行任务的功能,包括任务结果的保存,展示。任务启动,禁用,等编辑,可多节点部署,随意水平扩展。☆14Aug 14, 2019Updated 6 years ago
- 爬取CNVD,CNNVD,中国工控网,以及对于工控网站的选取分析☆18Jan 8, 2018Updated 8 years ago
- Web安全-XSS的攻击和防范☆18Sep 2, 2022Updated 3 years ago
- ☆13May 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 🖼📷【名称】:🆒优爱酷批量长网页整页截图系统(原创)【软件功能概述】:💻仿人工操作智能全自动滚动屏幕滚动条并保存截图 【输出图片格式】:png,gif,bmp,jpg,PDF 【批量截图方式】:📝txt批量,❎Excel批量,👣步进批量,🎯模拟点击,📅定时计…☆13Sep 2, 2022Updated 3 years ago
- 本项目为企业工商信息网络爬虫,输入行业关键字,例如“铜箔”,可爬取八方资源网等工商信息网上所有与铜箔有关企业的工商信息。☆24Jul 5, 2018Updated 7 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆14Jan 3, 2023Updated 3 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆39Mar 4, 2026Updated 3 weeks ago
- 🌏实时监控900多家中国企业的新闻动态☆23Oct 10, 2017Updated 8 years ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 3 years ago