A port of the arclabs 'readability' package to Java
☆72Sep 10, 2012Updated 13 years ago
Alternatives and similar repositories for Java-readability
Users that are interested in Java-readability are comparing it to the libraries listed below
Sorting:
- A bundle of html content extraction algorithms☆122Mar 27, 2015Updated 10 years ago
- Java clone for python term extractor topia.termextract☆34Aug 22, 2014Updated 11 years ago
- Readability clone in Java☆461Oct 13, 2020Updated 5 years ago
- The Kyoyo Language Modeling Toolkit☆27Nov 27, 2014Updated 11 years ago
- Dialog for Android TextView to improve readability☆14Sep 30, 2015Updated 10 years ago
- Clojure bindings to Apache Tika project☆24Jul 4, 2013Updated 12 years ago
- Samples for jetbrick-template-2x☆11Mar 17, 2017Updated 9 years ago
- 分布式网络爬虫架构☆16Sep 26, 2016Updated 9 years ago
- 抓取各报社报纸信息-采用配置文件形式实现的一个简单的可定制爬虫☆11Sep 1, 2022Updated 3 years ago
- Web/FileSystem Crawler Library☆36Updated this week
- An Android HTTP Library with OkHttp.☆15May 11, 2017Updated 8 years ago
- Please note that this legacy AlchemyAPI SDK is no longer supported by IBM. Please use the Watson SDKs https://github.com/watson-developer…☆14Sep 28, 2016Updated 9 years ago
- 网络爬虫☆51Mar 18, 2014Updated 12 years ago
- 微博 收藏抓取,瀑布流+卡片式展示☆20Feb 5, 2015Updated 11 years ago
- jQuery waterfall Plugin☆66Apr 3, 2018Updated 7 years ago
- Load Tensorflow pb file using Bert/TextCNNs, an ensemble model using Java.☆11Aug 20, 2021Updated 4 years ago
- Clojure library for sitemap generation.☆16Feb 25, 2019Updated 7 years ago
- 基于人工神经网络的中文语义相似度计算研究☆11Apr 1, 2013Updated 12 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,530Apr 18, 2017Updated 8 years ago
- ⚠️UNMAINTAINED☆11Mar 27, 2020Updated 5 years ago
- A robots.txt parser written in Clojure.☆16Dec 15, 2011Updated 14 years ago
- JS module for making short summary of some text☆13Nov 3, 2014Updated 11 years ago
- Leiningen plugin to compile haml files☆34Dec 30, 2013Updated 12 years ago
- location-based resource tracking☆10Jan 30, 2022Updated 4 years ago
- Spider_SinaTweetCrawler, to crawl tweet content from sinaTweet. (java)☆23Apr 5, 2017Updated 8 years ago
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- Python library for managing stop words in many languages.☆12May 11, 2015Updated 10 years ago
- A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.☆14Mar 8, 2022Updated 4 years ago
- CRFs based Chinese word segmentor☆21Oct 8, 2014Updated 11 years ago
- Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignm…☆49Jun 9, 2012Updated 13 years ago
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆13Oct 2, 2016Updated 9 years ago
- Data Structure Serial - Graph☆11Sep 19, 2019Updated 6 years ago
- Work in progress transmit from Google Code☆1,127Jan 3, 2018Updated 8 years ago
- A library to make use of Java NLP libraries like Stanford's CoreNLP and CMU's ark-tweet-nlp in Clojure☆37Jun 5, 2015Updated 10 years ago
- Package implements a number local outlier factor algorithms for outlier detection and finding anomalous data☆12Jun 7, 2017Updated 8 years ago
- A Clojurescript interface to the Web Audio API, intended for sonification☆12Jul 22, 2020Updated 5 years ago
- HearSight智能音视频内容分析工具,支持多源视频(包括 URL和上传文件方式)导入能够从输入的视频源中提取上下文信息,从而提供更精准的 AI问答交互。平台基于视频语义单元 进行智能切片,用户可通过问答方式灵活调整切片维度,快速定位所需内容同时,HearSight支持自动…☆34Dec 12, 2025Updated 3 months ago
- "쓰면서 배우는 OAuth 2.0 & OpenID Connect" 자료 저장소☆24Jan 24, 2024Updated 2 years ago
- A full and clean frontend web workflow Yeoman generator. Simple configuration, powerful preprocessing and image pipeline, livereload and …☆40Apr 5, 2020Updated 5 years ago