CrawlScript / DataHrefLinks
数据挖掘算法及工具教程
☆27Updated 9 years ago
Alternatives and similar repositories for DataHref
Users that are interested in DataHref are comparing it to the libraries listed below
Sorting:
- Apache hadoop management system☆314Updated 10 years ago
- A lite distributed Java spider framework :-)☆146Updated 8 years ago
- tns provides distributed solutions for thrift, support service discovery, high availability, load balancing, the gray release, horizontal…☆48Updated 8 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆156Updated 7 years ago
- distributed cache based on redis ,support sharding,HA☆77Updated 12 years ago
- ☆106Updated 10 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆88Updated 7 years ago
- 让天下没有难开发的即时通讯☆64Updated 8 years ago
- 海狗-多维在线分析系统☆72Updated 11 years ago
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆130Updated 6 years ago
- Java MVC framework, agile, fast, rich domain model, made especially for server side of mobile application (一个敏捷,快速,富领域模型的Java MVC 框架,专为 移…☆547Updated 2 years ago
- Java technology route☆65Updated 9 years ago
- 基于hadoop思维的分布式网络爬虫。☆85Updated 9 years ago
- A free-style benchmarking tool that can test anything callable by Java. And it produces apache-ab-like results☆57Updated 7 years ago
- ☆50Updated 11 years ago
- A distributed real-time stock picking system base on flume,kafka,jstorm,esper,and mysql☆161Updated 9 years ago
- 已作为 Hasor 的子项目,迁移到:http://git.oschina.net/zycgit/hasor☆79Updated 8 years ago
- Navi is a distributed service framework that provides cluster management and high performance RPC☆94Updated 9 years ago
- (用于归档)各种开源项目代码学习研究(包括代码注释、文档、用于代码分析的测试用例)☆216Updated 8 years ago
- Jack is a cluster manager built on top of Zookeeper and thrift.☆50Updated 2 years ago
- Uncode-DAL 是 Java 通用数据访问组件,基于mybatis、spring jdbc、hibernate等ORM框架开发,同时支持基于多数据源的读写分离、主备切换、故障转移,自动恢复、负载均衡、缓存等。可以大大提高开发速度。☆134Updated 9 years ago
- a simple netty RPC framework。 use protostuff-1.07 for serializer,use netty-3.2.1 for nio.☆41Updated 11 years ago
- 模仿Java标准库的一些API实现的算法库,包括了数据结构,字符串处理(StringBuilder),图(有向图)。原来是用Python实现的,但是Python实现的并没有经过完整的测试,不能够保证完全的正确性。 使用Java实现的集合库都经过完整的测试,实际上,我在实现的…☆49Updated 10 years ago
- common toolkit☆18Updated 9 years ago
- A Dapper based Large-Scale Distributed Systems Tracing Infrastructure☆98Updated 9 years ago
- 数据虫巢(微信号blogchong)公众号技术文章合集。虫巢出品,不说优品,最起码也得算个良品呐~~☆25Updated 8 years ago
- Symphony 的企业版,实现企业内网论坛。☆121Updated 8 years ago
- 高性能轻便的序列化组件☆34Updated 12 years ago
- java 分布式数据库访问框架,可以结合任何使用PreparedStatement操作的框架。在java jdbc api层实现 分表分库 路由解析的 框架 可以单独或者与用hibernate ibatis spring-jdbc 等框架结合使用,屏蔽api层使用差异,能实…☆83Updated 3 years ago
- This is a light distributed real time computing framework. It can help you quickly setup a your-self defined distributed real time comput…☆91Updated 9 years ago