pvdlg / boilerpipeLinks
Repackaging of Boilerpipe published on Maven Central Repository.
☆53Updated last year
Alternatives and similar repositories for boilerpipe
Users that are interested in boilerpipe are comparing it to the libraries listed below
Sorting:
- Readability clone in Java☆460Updated 5 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com☆343Updated 6 years ago
- boilerpipe 1.2.2 - a fork from 1.2.0 with additional features☆43Updated 8 years ago
- Neo4j JDBC driver☆69Updated last year
- A language detection library for the JVM☆36Updated 2 years ago
- (DEPRECATED) -- moved under: https://github.com/FasterXML/jackson-dataformats-text☆194Updated 7 years ago
- Some example code of using Akka from Java☆121Updated 10 years ago
- Java port of Facebook's PlanOut A/B testing system with additional functionality☆120Updated 2 years ago
- A new object-graph-wrapper for the Tinkerpop 3 graph stack.☆40Updated 4 years ago
- Elasticsearch Index Termlist☆118Updated 6 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆250Updated this week
- This is example of Jersey's Observable (RxJava) client extension using Netflix Hystrix latency and fault tolerant library.☆28Updated 9 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆222Updated 2 years ago
- Wikidata processing with Akka streams Proof of Concept☆52Updated 10 years ago
- Distributed Realtime Search with Lucene and MongoDB☆60Updated 7 years ago
- Addon bundle for Dropwizard to support Java 8 features☆60Updated 9 years ago
- Java Perceptual Hash☆89Updated 8 years ago
- Solr / SolrCloud running in high performance server - tiny, fast startup, simple to configure, easy deployment without an application ser…☆96Updated 5 years ago
- (DEPRECATED) -- moved under `jackson-dataformats-binary☆39Updated 8 years ago
- command line tool for Apache Lucene☆163Updated 5 months ago
- Kibana-friendly Transport and HTTP Elasticsearch reporters for Dropwizard Metrics☆14Updated 10 years ago
- Integration of Samza and Luwak☆100Updated 11 years ago
- Chalk is a natural language processing library.☆260Updated 8 years ago
- Fureteur is a simple, configurable, fault-tolerant web crawler written is Scala☆28Updated 11 years ago
- (DEPRECATED) -- moved as a sub-project of `jackson-dataformats-text`☆138Updated 7 years ago
- A web-latency SQL spout for Hadoop.☆50Updated 4 years ago
- Multidimensional data storage with rollups for numerical data☆267Updated last month
- Java client for the HipChat v2 API☆39Updated 8 years ago
- UADetector is a library to identify over 190 different desktop and mobile browsers and 130 other User-Agents like feed readers, email cli…☆248Updated 3 years ago
- File-backed append-only object store.☆117Updated 9 years ago