wuman / JReadabilityLinks
Java port of Arc90's Readability.js - parses HTML as input and returns clean, easy-to-read text
☆172Updated 11 years ago
Alternatives and similar repositories for JReadability
Users that are interested in JReadability are comparing it to the libraries listed below
Sorting:
- Readability clone in Java☆459Updated 4 years ago
- A bundle of html content extraction algorithms☆122Updated 10 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆155Updated 6 years ago
- A Java library to detect and normalize URLs in text☆782Updated 3 weeks ago
- Java multithread download library☆268Updated 2 years ago
- a tool which can let us log on the running application without restart.☆148Updated 8 years ago
- 结巴分词(java版)☆37Updated 10 years ago
- When jsoup meets XPath.☆468Updated 2 years ago
- A lite distributed Java spider framework :-)☆145Updated 8 years ago
- 自动抽取网页正文的算法,用JAVA实现☆106Updated 8 years ago
- Id generator based on Twitter's Snowflake☆105Updated 11 years ago
- Douban Java SDK OAuth2☆186Updated 8 years ago
- Java parser for .proto schema declarations.☆211Updated 6 years ago
- MultiThread Downloading Tool with Java Swing☆22Updated 10 years ago
- Yet another markdown processor for the JVM☆447Updated 4 years ago
- This project is a fork of https://bitbucket.org/luciad/webp-imageio☆42Updated 10 years ago
- Markdown parser and transformer implemented in Java☆151Updated 4 years ago
- Scala SDK for http://www.douban.com☆40Updated 8 years ago
- Resources for writing modern Java☆82Updated 9 years ago
- A dynamic compilation tool for java which also allows you to cut language features.☆48Updated 10 years ago
- nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。☆130Updated 6 years ago
- MarkdownJ☆462Updated 4 years ago
- concurrency, collections, stats/analytics, config, testing, etc☆669Updated 4 years ago
- Superword is a Java open source project dedicated in the study of English words analysis and auxiliary reading.☆269Updated 2 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆88Updated 7 years ago
- A Java WebSocket/HTTP server based on the Atmosphere and Netty Framework☆324Updated last month
- Tiny HTTP router library for Netty, that can route and create reverse routes☆133Updated 7 years ago
- ☆22Updated 10 years ago
- 基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件☆124Updated 10 years ago
- A stand-alone Bloom filter implementation written in Java☆483Updated 13 years ago