gsh199449 / spider
A configurable web spider with a easy-to-use web console
☆993Updated 6 years ago
Alternatives and similar repositories for spider:
Users that are interested in spider are comparing it to the libraries listed below
- WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup …☆3,077Updated last month
- A lightweight web crawler framework.(Java爬虫框架)☆713Updated last month
- Easy to use lightweight web crawler(易用的轻量化网络爬虫)☆2,510Updated last year
- 使用WebMagic抓取招聘信息,并且持久化到Mysql的例子。☆224Updated 8 years ago
- 一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.☆1,982Updated 3 months ago
- A headless,standalone webkit server which make grabing dynamic web page easier.☆225Updated 6 years ago
- (微信开发工具包)weixin sdk for Java☆817Updated 2 months ago
- Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywo…☆921Updated last year
- 极其方便的实现微信公众平台服务端开发,2行代码完成服务器绑定,3行代码实现用户消息监听☆774Updated 7 months ago
- 给爬虫使用的代理IP池☆552Updated 5 years ago
- The minimalist framework of RESTful(server and client) - Resty☆1,244Updated 3 years ago
- A Java CAPTCHA recognition library for sticky characters☆207Updated 10 years ago
- 🗯 wechat-api by java7.☆1,814Updated 6 years ago
- Java后端实现三方支付集成支付宝(国内、国际、移动端、PC端)、微信、银联(acp、upop)、光大(网关、网页)、邮政支付☆787Updated 7 years ago
- Java分布式中文分词组件 - word分词☆1,819Updated 3 years ago
- JavaEE项目开发脚手架(我的公众号:kaitao-1234567,我的新书:《亿级流量网站架构核心技术》)☆2,164Updated 6 years ago
- JDeploy自动化部署平台☆586Updated 2 years ago
- 新浪微博爬虫,采用Java语言开发,基于HTTPClient 4.0,采用MySQL存储爬取数据,支持多进程并发执行。功能包括:爬取微博、评论、转发、关注列表(层次)。根据数据需求,持续更新...☆352Updated 11 years ago
- 基于Spring Boot的注解驱动式公众号极速开发框架,用注解重新定义公众号开发☆652Updated 2 years ago
- a java blog☆246Updated 7 months ago
- beetl2.0☆415Updated 5 years ago
- 分布式任务调度平台(Distributed Job Schedule Platform)☆559Updated 2 years ago
- 通用权限管理系统:作为配置中心,管理后台系统的菜单、功能、用户、角色等,并提供DUBBO接口。☆512Updated 6 years ago
- Jsoup学习笔记。添加了部分学习代码和注释。☆637Updated last year
- APDPlat是Application Product Development Platform的缩写,即应用级产品开发平台。☆521Updated 2 years ago
- NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。☆639Updated 4 years ago
- zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目☆914Updated 5 years ago
- rapid open platform☆421Updated 7 years ago
- 对java、scala等运行于jvm的程序进行实时日志采集、索引和可视化,对系统进行进程级别的监控,对系统内部的操作进行策略性的报警、对分布式的rpc调用进行trace跟踪以便于进行性能分析☆864Updated 2 years ago
- 基于 webmagic 的 Java 爬虫应用☆2,781Updated 3 years ago