本爬虫用于爬取知乎网站问题、回答的相关字段信息,问题的标题、内容、发布时间、话题、回答数量、评论数、点击数、关注数等字段,及对该问题回答的内容,作者、点赞数、评论数、回答时间等等字段信息。可用于对社会话题、热点进行数据分析。
☆43Nov 30, 2018Updated 7 years ago
Alternatives and similar repositories for zhihuSpider
Users that are interested in zhihuSpider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 知乎爬虫,可以爬取知乎上特定问题下的所有回答、单个回答,特定用户的所有回答、文章,话题精华,收藏夹,专栏,文章☆78Sep 27, 2019Updated 6 years ago
- GOAT(山羊)是中英文大语言模型,基于LlaMa进行SFT。☆12Apr 24, 2023Updated 3 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- vscode 的 88code 剩余积分及 codex 余额显示插件☆18Oct 25, 2025Updated 6 months ago
- ☆12Apr 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 抖音爬虫——采集账号主页、喜欢、收藏、音乐原声、搜索、关注、粉丝、合集、单作品。支持抖音号查询信息(精确粉丝数)。支持搭建API。接口版:post分支☆23Jul 28, 2023Updated 2 years ago
- Encyclopedic Hub for Sentiment Dictionaries☆15Nov 20, 2025Updated 5 months ago
- This repo contains the solutions of UC Berkeley CS 61B spring semester 2018, and materials including slides, lecture codes, exams and dis…☆15May 24, 2024Updated last year
- Homework exercises from the "Understanding Cryptography" textbook and accompanying lecture series.☆27Mar 25, 2018Updated 8 years ago
- 抓取淘女郎图片的简单爬虫,对应博文[python爬虫入门教程(三):淘女郎爬虫 ( 接口解析 | 图片下载 )](https://blog.csdn.net/aaronjny/article/details/80291997)。☆11May 13, 2018Updated 7 years ago
- 新华网和人民网的简单关键词Scrapy爬虫☆12Jun 2, 2022Updated 3 years ago
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆26Dec 2, 2018Updated 7 years ago
- 实现功能:新输入一段文本,与已有数据进行相似度进行比较,返回TOP10的文本。主要实现方法:jieba中文分词、gensim、TF-IDF词汇重要性、cosine余弦相似度。☆11Jul 30, 2020Updated 5 years ago
- Scrapes TFRs from FAA site☆21Oct 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Predict human emotions in tweets by mapping emojis into the Valence-Arousal space (Russell, 2005). LSTM models the sequence learning of w…☆24Dec 8, 2022Updated 3 years ago
- Sharable scripts and stylesheets from the Northeastern University Women Writers Project☆24Apr 7, 2026Updated last month
- flightradar24 GUI client built with Python☆16Nov 17, 2018Updated 7 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29May 13, 2020Updated 5 years ago
- Sentence embedding using Smooth Inverse Frequency weighting scheme☆15Feb 21, 2020Updated 6 years ago
- 基于Scrapy+Elasticsearch+Django搭建的分布式电影搜索☆31Jul 25, 2018Updated 7 years ago
- WSDM‘2022: Knowledge Enhanced Sports Game Summarization☆18Jun 16, 2022Updated 3 years ago
- Node.js app to watch files and directories then sync them to the remote server using rsync☆22Apr 8, 2026Updated last month
- ☆19Jan 7, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python课程作业:爬虫爬取豆瓣图书信息☆21May 17, 2020Updated 5 years ago
- 暴力检测一些qq企业邮箱弱口令的用户。以提醒他们修改密码☆13Nov 25, 2015Updated 10 years ago
- Implementations of various sentiment analysis methods in Python.☆33Nov 10, 2017Updated 8 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆35Feb 2, 2026Updated 3 months ago
- 识别网站cms指纹☆12May 19, 2019Updated 6 years ago
- 自写爬虫爬取知乎问题及回答☆39Jun 10, 2019Updated 6 years ago
- Unsupervised text segmentation based on Latent Dirichlet Allocation and Topic Tiling☆24Aug 6, 2016Updated 9 years ago
- A list of ethics related resources for researchers and practitioners of Natural Language Processing and Computational Linguistics☆34Oct 20, 2025Updated 6 months ago
- Searching algorithm base on Pacman☆16Oct 3, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Usings LLM chat with knowledges☆21Aug 12, 2023Updated 2 years ago
- 基于BERT模型的中文文本情感分类☆40Oct 29, 2022Updated 3 years ago
- Adds the ability to send CarrierWave uploads to Attachment Scanner for virus and malware prevention.☆17Feb 13, 2026Updated 2 months ago
- Uses GloVe embeddings and greedy sequence segmentation to semantically segment a text document into any number of k segments.☆33Feb 17, 2019Updated 7 years ago
- 一个简单的HTTP暴力破解、撞库攻击脚本☆14Sep 12, 2015Updated 10 years ago
- “谛听”(discern)资产识别分析平台,一个简化版的物联网设备信息安全搜索引擎,IOT—Scanner的迭代优化版本。目前集成了主机发现、端口扫描、设备识别、漏洞匹配、poc验证等功能。☆17Feb 6, 2021Updated 5 years ago
- azazel反编译器 ftrace函数追踪 elfdemon 代码注入 lpv,skeksi,saruman 病毒 quenya 重建进程☆20Aug 26, 2018Updated 7 years ago