jdf / cue.languageLinks
A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.
☆156Updated 5 years ago
Alternatives and similar repositories for cue.language
Users that are interested in cue.language are comparing it to the libraries listed below
Sorting:
- A small Java library for simple text analysis - counting strings, identifying languages, and removing stop words.☆58Updated 7 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆108Updated 4 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- Tiny REST client for Java and JSON.☆272Updated 8 years ago
- Readability clone in Java☆460Updated 4 years ago
- [not maintained] Custom Twitter Search via ElasticSearch&Wicket☆60Updated 4 years ago
- Leaner version of jpropel, containing only LINQ, reified collections and utilities for arrays/strings/numerics/files/xml etc.☆125Updated 12 years ago
- A Sass compiling filter for Java web apps☆81Updated 8 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆73Updated last year
- This project was the home of code used to develop a modern date and time library for JDK8. Development has moved to OpenJDK and a separat…☆192Updated 9 years ago
- SQLite JDBC Driver☆159Updated 15 years ago
- Prefuse is a set of software tools for creating rich interactive data visualizations in the Java programming language. Prefuse supports a…☆572Updated last year
- ☆161Updated 7 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 3 years ago
- A fast and easy to use decision tree learner in java☆234Updated 3 years ago
- Common Crawl support library to access 2008-2012 crawl archives (ARC files)☆500Updated 7 years ago
- A Lazy Data Flow Framework (no longer active - see Apache TinkerPop)☆279Updated 4 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆127Updated last month
- A set of reusable Java components that implement functionality common to any web crawler☆246Updated last week
- An Object to Graph Framework (no longer active - see Apache TinkerPop)☆137Updated 4 years ago
- A Java library that manages component action/event bindings for MVC patterns☆116Updated last month
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- Haml (XHTML Abstraction Markup Language) implementation in Java.☆83Updated 12 years ago
- A koan-style tutorial in Java for Neo4j☆319Updated 10 years ago
- Find the Git commits you're looking for☆121Updated 2 years ago
- A lightweight platform monitoring tool for Java VMs☆155Updated 8 years ago
- GWT implementation of standard the node.js library☆87Updated 13 years ago
- Java implementation of a probabilistic set data structure☆144Updated 8 years ago
- Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.☆159Updated 2 years ago
- Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JS…☆155Updated 8 years ago