wuman / JReadability
Java port of Arc90's Readability.js - parses HTML as input and returns clean, easy-to-read text
☆170Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for JReadability
- A bundle of html content extraction algorithms☆121Updated 9 years ago
- Readability clone in Java☆461Updated 4 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆157Updated 6 years ago
- 自动抽取网页正文的算法,用JAVA实现☆107Updated 7 years ago
- a tool which can let us log on the running application without restart.☆149Updated 8 years ago
- 结巴分词(java版)☆37Updated 10 years ago
- A lite distributed Java spider framework :-)☆148Updated 7 years ago
- A simple implementation of simhash algorithm by java.☆154Updated 4 years ago
- Yet another markdown processor for the JVM☆449Updated 4 years ago
- Memory consumption estimator for Java☆120Updated 6 years ago
- Java multithread download library☆269Updated 2 years ago
- A fast async http client based on netty☆42Updated 11 years ago
- ☆233Updated last month
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆129Updated 5 years ago
- Lightweight dependency injection for Java and Android (JSR-330)☆356Updated 6 years ago
- Language Detection Library for Java☆569Updated 2 years ago
- More kryo serializers☆381Updated 2 months ago
- Mensa is a generic, flexible, enhanced, and efficient Java implementation of a pattern matching state machine as described by the 1975 pa…☆94Updated 9 years ago
- Java诊断工具☆74Updated 7 years ago
- Douban Java SDK OAuth2☆186Updated 7 years ago
- Scala SDK for http://www.douban.com☆40Updated 8 years ago
- Java parser for .proto schema declarations.☆211Updated 6 years ago
- A small and easy to use parser generator. Specify your grammar in pure java and compile dynamically. Especially suitable for DSL creation…☆92Updated 3 years ago
- Save 25% memory for you.☆136Updated 9 years ago
- A Java WebSocket/HTTP server based on the Atmosphere and Netty Framework☆321Updated 11 months ago
- 基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件☆123Updated 9 years ago
- Socket.IO Client Implementation in Java☆22Updated 6 years ago