robbypond / boilerpipe
boilerpipe 1.2.2 - a fork from 1.2.0 with additional features
☆10Updated 8 years ago
Alternatives and similar repositories for boilerpipe:
Users that are interested in boilerpipe are comparing it to the libraries listed below
- Repackaging of Boilerpipe published on Maven Central Repository.☆53Updated last year
- Readability clone in Java☆460Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 6 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆127Updated 10 months ago
- Juicer is a web API for extracting text, meta data and named entities from HTML "article" type pages.☆60Updated 9 years ago
- Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JS…☆155Updated 7 years ago
- App Engine Java Managed VMs example: multi-stage tutorial based on adding features to the 'guestbook'.☆37Updated 8 years ago
- A Java object representation of the Open Graph protocol for a web page☆159Updated 10 months ago
- A small library for composing asynchronous code☆285Updated 5 years ago
- Foursquare V2 API for Java☆30Updated 7 years ago
- Comprehensive and FULL Java client for the Google Places API☆171Updated 3 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- Solr query parser plugin that performs proper query-time synonym expansion.☆150Updated 3 years ago
- boilerpipe 1.2.2 - a fork from 1.2.0 with additional features☆43Updated 7 years ago
- ☆65Updated 8 years ago
- Java text categorization system☆55Updated 7 years ago
- Models for POS tagging and sentence and tokens detection with OpenNLP tools for italian language☆52Updated 11 years ago
- Elasticsearch entity resolution plugin based on Duke☆210Updated 4 years ago
- A small program to detect gibberish using a Markov Chain☆27Updated 5 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆240Updated last month
- Official Java client for the Keen IO API. Build analytics features directly into your Java apps.☆74Updated last year
- SKOS analysis for Elasticsearch☆54Updated 8 years ago
- ☆13Updated 9 years ago
- A maven plugin for packing Dropwizard applications as Debian packages.☆52Updated 8 years ago
- Java Library for authentication, getting profile, contacts and updating status on Google, Yahoo, Facebook, Twitter, LinkedIn, and many mo…☆250Updated last year
- Socialize SDK for Android. An Android social sharing SDK for native apps.☆146Updated 8 years ago
- Aho-Corasick algorithm as implemented in Java by Danny Yoo, with little improvements☆26Updated 10 years ago
- A high-performance sharded counter implementation for Google Appengine.☆29Updated 5 years ago