wuman / JReadability
Java port of Arc90's Readability.js - parses HTML as input and returns clean, easy-to-read text
☆170Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for JReadability
- Readability clone in Java☆461Updated 4 years ago
- A bundle of html content extraction algorithms☆121Updated 9 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆157Updated 6 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- a tool which can let us log on the running application without restart.☆149Updated 8 years ago
- Id generator based on Twitter's Snowflake☆103Updated 11 years ago
- A simple implementation of simhash algorithm by java.☆154Updated 4 years ago
- 结巴分词(java版)☆37Updated 10 years ago
- 自动抽取网页正文的算法,用JAVA实现☆107Updated 7 years ago
- A lite distributed Java spider framework :-)☆148Updated 7 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆86Updated 6 years ago
- This project is a fork of https://bitbucket.org/luciad/webp-imageio☆42Updated 9 years ago
- Scala SDK for http://www.douban.com☆40Updated 8 years ago
- Advanced state machines in Java.☆97Updated 5 years ago
- Lucene 中文分词“庖丁解牛” Paoding Analysis☆25Updated 13 years ago
- Memory consumption estimator for Java☆120Updated 6 years ago
- Threads benchmark code for Takipi blog☆71Updated 9 years ago
- SendCloud SDK For Java (sendcloud4j)☆36Updated 3 years ago
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆129Updated 5 years ago
- Markdown parser and transformer implemented in Java☆151Updated 3 years ago
- 微信开放平台Java SDK☆51Updated 7 years ago
- A dynamic compilation tool for java which also allows you to cut language features.☆48Updated 9 years ago
- ☆187Updated last month
- Java port of Lokesh Dhakar Color Thief (grab color palette from an image)☆48Updated 6 years ago
- GraphicsMagick interactive mode Java integration☆64Updated 8 years ago
- 基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件☆123Updated 9 years ago
- A headless,standalone webkit server which make grabing dynamic web page easier.☆224Updated 5 years ago