jdf / cue.language
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆155Updated 4 years ago
Related projects: ⓘ
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆58Updated 6 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated last year
- SIREn - Semi-Structured Information Retrieval Engine☆106Updated 3 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆61Updated 3 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 6 years ago
- SQLite JDBC Driver☆159Updated 14 years ago
- A Java library that manages component action/event bindings for MVC patterns☆111Updated this week
- A fast and easy to use decision tree learner in java☆232Updated 2 years ago
- ☆21Updated this week
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆70Updated 5 months ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 9 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆56Updated 2 years ago
- Leaner version of jpropel, containing only LINQ, reified collections and utilities for arrays/strings/numerics/files/xml etc.☆124Updated 11 years ago
- GWT implementation of standard the node.js library☆87Updated 12 years ago
- Java text categorization system☆54Updated 7 years ago
- Delicious on Android☆17Updated 11 years ago
- Practical Algorithm to Retrieve Information Coded in Alphanumeric (PATRICIA)☆176Updated 5 years ago
- Java framework for Google App Engine☆80Updated 4 years ago
- This project aims to provide a Java wrapper for Github API.☆64Updated 10 years ago
- Java port of Python NLTK Vader Sentiment Analyzer. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon and rule-based se…☆61Updated last year
- Cross platform mobile development toolkit consisting of a DSL for defining mobile apps and code generators for creating native apps for i…☆96Updated 4 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- ☆43Updated this week
- Haml (XHTML Abstraction Markup Language) implementation in Java.☆82Updated 11 years ago
- Bulk loading for elastic search☆186Updated 9 months ago
- Java implementation of a probabilistic set data structure☆142Updated 7 years ago
- Provides support to increase developer productivity in Java when using a graph database like Neo4j. Uses familiar Spring concepts such a…☆64Updated 2 years ago
- Hunspell library for Java based on JNA☆62Updated last year
- A Sass compiling filter for Java web apps☆79Updated 7 years ago