jdf / cue.language
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆156Updated 5 years ago
Alternatives and similar repositories for cue.language
Users that are interested in cue.language are comparing it to the libraries listed below
Sorting:
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆59Updated 7 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 7 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆106Updated 3 years ago
- SQLite JDBC Driver☆159Updated 15 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆59Updated 4 years ago
- A Java object representation of the Open Graph protocol for a web page☆160Updated last year
- Find the Git commits you're looking for☆118Updated 2 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆71Updated last year
- GWT implementation of standard the node.js library☆87Updated 13 years ago
- A Java library that manages component action/event bindings for MVC patterns☆113Updated last week
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- Java implementation of a probabilistic set data structure☆143Updated 7 years ago
- ☆161Updated 7 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- Leaner version of jpropel, containing only LINQ, reified collections and utilities for arrays/strings/numerics/files/xml etc.☆125Updated 11 years ago
- open-source word clouds for Processing☆201Updated 2 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- Cross platform mobile development toolkit consisting of a DSL for defining mobile apps and code generators for creating native apps for i…☆96Updated 5 years ago
- Java port of Python NLTK Vader Sentiment Analyzer. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based se…☆63Updated 2 years ago
- Java framework for Google App Engine☆80Updated 5 years ago
- Eclipse plugin for Apache Pig☆33Updated 11 years ago
- ☆36Updated 12 years ago
- SimpleJPA - Java Persistence API (JPA) implementation for Amazon SimpleDB☆52Updated 4 years ago
- Bulk loading for elastic search☆184Updated last year
- Simile Widgets Exhibit 3 code repository☆161Updated 11 years ago
- DocId set compression and set operation library☆27Updated 11 years ago
- A Java implementation of Twitter's text processing library☆363Updated 10 years ago
- MVEL (MVFLEX Expression Language)☆50Updated 13 years ago