jdf / cue.language
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆156Updated 5 years ago
Alternatives and similar repositories for cue.language:
Users that are interested in cue.language are comparing it to the libraries listed below
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆59Updated 7 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆281Updated 6 years ago
- Tiny REST client for Java and JSON.☆271Updated 8 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆107Updated 3 years ago
- A fast and easy to use decision tree learner in java☆232Updated 2 years ago
- Find the Git commits you're looking for☆118Updated 2 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 4 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆71Updated 11 months ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- SQLite JDBC Driver☆159Updated 15 years ago
- Set of JavaDoc doclets for modern Java annotations APIs☆26Updated 10 years ago
- faceted search engine☆55Updated 11 years ago
- SimpleJPA - Java Persistence API (JPA) implementation for Amazon SimpleDB☆52Updated 4 years ago
- Sitebricks: A fast platform for web development.☆248Updated 2 years ago
- Leaner version of jpropel, containing only LINQ, reified collections and utilities for arrays/strings/numerics/files/xml etc.☆125Updated 11 years ago
- Mirror of Apache MRUnit☆38Updated 6 years ago
- Gretty is simple framework for networking☆91Updated 13 years ago
- A lightweight platform monitoring tool for Java VMs☆154Updated 8 years ago
- A memory-resident geospatial index library for Java☆41Updated 4 years ago
- A new object-graph-wrapper for the Tinkerpop 3 graph stack.☆40Updated 4 years ago
- A Java library that manages component action/event bindings for MVC patterns☆113Updated last week
- Bulk loading for elastic search☆185Updated last year
- This project was the home of code used to develop a modern date and time library for JDK8. Development has moved to OpenJDK and a separat…☆191Updated 8 years ago
- Java implementation of a probabilistic set data structure☆143Updated 7 years ago
- Yoga is RESTful but flexible.☆157Updated last year