vcl-xx / cue.language
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆59Updated 7 years ago
Alternatives and similar repositories for cue.language:
Users that are interested in cue.language are comparing it to the libraries listed below
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆156Updated 5 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- Java Binding for the OpenAL API☆15Updated last year
- Set of JavaDoc doclets for modern Java annotations APIs☆26Updated 10 years ago
- Pure Java implementation of the liblzo2 LZO compression algorithm☆47Updated 13 years ago
- Java implementation of a probabilistic set data structure☆143Updated 7 years ago
- DocId set compression and set operation library☆27Updated 10 years ago
- Eclipse plugin for Apache Pig☆33Updated 11 years ago
- AsyncHttpClient transport support for Jersey☆19Updated 10 years ago
- A fast and easy to use decision tree learner in java☆232Updated 2 years ago
- Library for creating In-memory circular buffers that use direct ByteBuffers to minimize GC overhead☆136Updated 2 years ago
- A runtime controller for OSGi based on a REST protocol☆15Updated 13 years ago
- Distributed Java Collections for ZooKeeper☆109Updated 8 years ago
- JNI Glue Code Generator☆91Updated 7 months ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆158Updated 2 years ago
- A variable length record, checksumming, append only rotating log implementation with graceful recovery☆54Updated 4 years ago
- Basic stand-alone disk-based N-way merge sort component for Java☆85Updated 2 weeks ago
- ☆29Updated this week
- Java text categorization system☆55Updated 7 years ago
- A distributed task queue worker designed for throughput, parallelism, and clustering.☆236Updated last year
- A library that adds some NLP capabilities to the Lucene search engine☆50Updated 11 years ago
- DirectMemory is a cache implementation featuring off-heap memory storage (a-la BigMemory) to enable caching of large (or large numbers of…☆151Updated 13 years ago
- A flexible pure-Java OCR implementation. Eventually.☆20Updated 10 years ago
- This project was the home of code used to develop a modern date and time library for JDK8. Development has moved to OpenJDK and a separat…☆191Updated 8 years ago
- Multi Framework OSGi Runner☆47Updated 4 years ago
- [NOT MAINTAINED ANYMORE] LZMA library for Java☆73Updated 6 years ago
- Continuous Streaming SQL Queries for Flume☆95Updated 13 years ago
- OSGified Scala libraries☆20Updated 13 years ago
- An OO/Functional Crit-bit tree in Java.☆27Updated 10 years ago
- Examples of use of pig scripting languages capabilities☆39Updated 8 years ago