wuman / JReadability
Java port of Arc90's Readability.js - parses HTML as input and returns clean, easy-to-read text
☆171Updated 11 years ago
Alternatives and similar repositories for JReadability:
Users that are interested in JReadability are comparing it to the libraries listed below
- Readability clone in Java☆459Updated 4 years ago
- A bundle of html content extraction algorithms☆121Updated 10 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆156Updated 6 years ago
- a tool which can let us log on the running application without restart.☆149Updated 8 years ago
- 自动抽取网页正文的算法,用JAVA实现☆106Updated 7 years ago
- Id generator based on Twitter's Snowflake☆104Updated 11 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆87Updated 6 years ago
- A simple implementation of simhash algorithm by java.☆155Updated 4 years ago
- 结巴分词(java版)☆37Updated 10 years ago
- A small and easy to use parser generator. Specify your grammar in pure java and compile dynamically. Especially suitable for DSL creation…☆92Updated 4 years ago
- Markdown parser and transformer implemented in Java☆151Updated 4 years ago
- A lite distributed Java spider framework :-)☆145Updated 7 years ago
- Tiny HTTP router library for Netty, that can route and create reverse routes☆132Updated 7 years ago
- 基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件☆123Updated 9 years ago
- Mensa is a generic, flexible, enhanced, and efficient Java implementation of a pattern matching state machine as described by the 1975 pa…☆94Updated 9 years ago
- Scala SDK for http://www.douban.com☆40Updated 8 years ago
- SendCloud SDK For Java (sendcloud4j)☆36Updated 3 years ago
- This project is a fork of https://bitbucket.org/luciad/webp-imageio☆42Updated 10 years ago
- Java诊断工具☆74Updated 7 years ago
- Java parser for .proto schema declarations.☆211Updated 6 years ago
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆130Updated 5 years ago
- Language Detection Library for Java☆575Updated 2 years ago
- Threads benchmark code for Takipi blog☆71Updated 10 years ago
- Douban Java SDK OAuth2☆186Updated 8 years ago
- When jsoup meets XPath.☆469Updated last year
- Advanced state machines in Java.☆97Updated 6 years ago
- MultiThread Downloading Tool with Java Swing☆23Updated 10 years ago
- A protobuf based high performance rpc framework leveraging full-duplexing and asynchronous io with netty☆166Updated 9 years ago
- ☆190Updated 6 months ago