Netbreeze-GmbH / boilerpipeLinks
boilerpipe 1.2.2 - a fork from 1.2.0 with additional features
☆43Updated 8 years ago
Alternatives and similar repositories for boilerpipe
Users that are interested in boilerpipe are comparing it to the libraries listed below
Sorting:
- Repackaging of Boilerpipe published on Maven Central Repository.☆53Updated 2 years ago
- Readability clone in Java☆461Updated 5 years ago
- A java library for stored queries☆378Updated 2 years ago
- boilerpipe 1.2.2 - a fork from 1.2.0 with additional features☆10Updated 9 years ago
- Event capture and querying framework for Java☆410Updated 4 years ago
- ☆61Updated 5 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 13 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com☆343Updated 6 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- Language Detection Library for Java☆586Updated 3 years ago
- Integration of Samza and Luwak☆100Updated 11 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆59Updated 4 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆255Updated 2 years ago
- A modular toolkit for building web services with Guice, inspired by DropWizard☆113Updated 2 weeks ago
- CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop☆37Updated last year
- Java Perceptual Hash☆89Updated 8 years ago
- Minimal example of getting Dropwizard going with Gradle (instead of Maven).☆64Updated 12 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 10 years ago
- A skeleton DropWizard Web Application integrating several useful open source projects☆116Updated 11 months ago
- A small library for composing asynchronous code☆287Updated 6 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆252Updated last week
- distributed Actors for Java 8 / JavaScript☆344Updated 2 years ago
- Fast Geo Lookup that returns a ZIP, city, and state for a given latitude and longitude☆25Updated 11 years ago
- Fauxflake is an easily embeddable, decentralized, k-ordered unique ID generator.☆42Updated 9 years ago
- Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.☆59Updated 10 years ago
- Lutung - A Java Mandrill API Connector☆178Updated 3 years ago
- Isomorphic (server-side) rendering of a simple react (comment box) component - Java 8's nashorn vs node.js microbenchmark☆23Updated 2 months ago
- Pure Java implementation of Van Der Maaten and Hinton's t-sne clustering algorithm☆199Updated 2 years ago
- Official AMD Aparapi repository☆341Updated 9 years ago
- Hierarchical Temporal Memory implementation in Java - an official Community-Driven Java port of the Numenta Platform for Intelligent Comp…☆315Updated 4 years ago