yida-lxw / spider4jLinks
Spider4j is an open source web crawler expand from webmagic for Java which provides a simple interface for crawling the Web. Using it, you can setup a multi-threaded web crawler in few minutes.
☆18Updated 2 years ago
Alternatives and similar repositories for spider4j
Users that are interested in spider4j are comparing it to the libraries listed below
Sorting:
- Gecko crawler supports distributed by redis☆24Updated 7 years ago
- 基于Solr4.9.0的搜索系统:包括Solr索引建立、Solr索引查询DUBBO接口等内容。☆30Updated 3 years ago
- DEPRECATED☆27Updated 7 years ago
- Gecco crawler downloader for htmlunit☆27Updated 9 years ago
- 大规模分布式系统的跟踪、监控、告警平台☆55Updated 11 years ago
- Chromium-based headless browser for java☆28Updated 9 years ago
- shiro 基于 url做的权限系统☆43Updated 11 years ago
- Cronner 是一个分布式定时任务框架,支持作业依赖、分片、失效转移、集中配置和监控。☆54Updated 7 years ago
- 基于Drools的规则引擎系统☆98Updated 12 years ago
- ☆22Updated 8 years ago
- 一套基于ES的搜索方案☆25Updated 3 years ago
- 基于ActiveMQ的数据交换中间件☆14Updated 11 years ago
- 简单实用的同步工具,实现mysql数据库中数据定期同步到elasticsearch,只需简单的配置,便能达到非凡的效果,支持elasticsearch 5.X版本☆48Updated 9 years ago
- 基于zookeeper+quartz/spring task的分布式任务调度组件,非常小巧,无需任何修改就可以使spring task具备分布式特性,确保所有任务在集群中不重复,不遗漏的执行。☆31Updated 10 years ago
- 舆情搜索服务框架,其中lucene和solr版本为4.8。☆61Updated 10 years ago
- An Example for Spring MVC and Spring Data Elasticsearch☆12Updated 11 years ago
- jvm 学习中积累的代码☆32Updated 11 years ago
- 铜板街轻量级JDBC层分库分表框架☆49Updated 2 years ago
- 代码备忘录, 包含mybatis、spring、spring-boot、hbase、hive、guava、jdk等示例代码☆20Updated 3 years ago
- 这是一个ZooKeeper客户端,实现了断线重连,会话过期重连,永久监听,子节点数据变化的监听。并且加入了常用功能,例如分布式锁,Leader选举,主从服务锁,分布式队列等。☆26Updated 9 years ago
- spring整合guava和redis实现本地和远程缓存☆42Updated 10 years ago
- 基于Drools的规则引擎☆137Updated 9 years ago
- Groovy based lightweight rule engine☆82Updated 9 years ago
- 提供基于大众点评CAT(v1.3.6)监控的扩展,主要是跨服务的消息树(dubbo、http方式)、Cache以及DB监控等☆68Updated 9 years ago
- 基于zookeeper的统一配置中心实现☆46Updated 10 years ago
- This a sample program about lucene5 written by myself when I learning lucene5,now share with everyone who love Lucene.☆19Updated 10 years ago
- 网站最底层代码-拷贝下来即可扩展使用☆29Updated 9 years ago
- Activiti干货,收集各种有价值的代码、想法。☆117Updated 2 years ago
- A java web servlet filter for distributed session cached . 分布式Java Web Session缓存。☆24Updated 11 years ago
- 该项目主要是为了熟悉sql的人员能够很方便的进行elasticsearch数据的查询,降低学习成本。☆48Updated 11 years ago