jdf / cue.languageLinks
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆156Updated 5 years ago
Alternatives and similar repositories for cue.language
Users that are interested in cue.language are comparing it to the libraries listed below
Sorting:
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆59Updated 7 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆107Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆282Updated 7 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 4 years ago
- Tiny REST client for Java and JSON.☆271Updated 8 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- ☆45Updated 14 years ago
- Find the Git commits you're looking for☆118Updated 2 years ago
- Readability clone in Java☆458Updated 4 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆129Updated last year
- A Lazy Data Flow Framework (no longer active - see Apache TinkerPop)☆277Updated 3 years ago
- Java utilities for working with Schema.org data in JSON-LD format☆72Updated 2 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆120Updated last month
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- A fast and easy to use decision tree learner in java☆232Updated 3 years ago
- SQLite JDBC Driver☆159Updated 15 years ago
- Haml (XHTML Abstraction Markup Language) implementation in Java.☆82Updated 12 years ago
- Java implementation of a probabilistic set data structure☆144Updated 8 years ago
- galimatias is a URL parsing and normalization library written in Java.☆162Updated last year
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- Weave (Web-based Analysis and Visualization Environment)☆369Updated 6 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs - http://gravity.com☆343Updated 5 years ago
- GWT implementation of standard the node.js library☆87Updated 13 years ago
- GContracts: Programming by Contract for Groovy☆112Updated 5 years ago
- Leaner version of jpropel, containing only LINQ, reified collections and utilities for arrays/strings/numerics/files/xml etc.☆125Updated 12 years ago
- Hidden Markov Models Java Library☆41Updated 8 years ago
- A lightweight Groovy toolkit for Google App Engine Java☆221Updated 6 years ago
- Sitebricks: A fast platform for web development.☆248Updated 2 years ago
- ☆36Updated 12 years ago