optimaize / language-detector
Language Detection Library for Java
☆577Updated 2 years ago
Alternatives and similar repositories for language-detector
Users that are interested in language-detector are comparing it to the libraries listed below
Sorting:
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆751Updated 6 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- A set of reusable Java components that implement functionality common to any web crawler☆244Updated 3 weeks ago
- Word2Vec Java Port☆186Updated 6 years ago
- Java Perceptual Hash☆88Updated 7 years ago
- Carrot2 plugin for ElasticSearch☆291Updated 2 years ago
- A bundle of html content extraction algorithms☆122Updated 10 years ago
- A simple implementation of simhash algorithm by java.☆155Updated 4 years ago
- Readability clone in Java☆459Updated 4 years ago
- The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike☆751Updated last month
- A scalable, mature and versatile web crawler based on Apache Storm☆907Updated this week
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆162Updated 4 years ago
- Java interface for fastText☆235Updated last year
- Apache OpenNLP☆1,509Updated this week
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆350Updated 3 years ago
- Hunspell library for Java based on JNA☆62Updated 2 years ago
- Java version of LIBLINEAR☆305Updated 4 months ago
- When jsoup meets XPath.☆468Updated last year
- A Java implementation of Locality Sensitive Hashing (LSH)☆297Updated 2 years ago
- A language detection library for the JVM☆36Updated last year
- Repackaging of Boilerpipe published on Maven Central Repository.☆53Updated last year
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆192Updated last year
- Java port of Arc90's Readability.js - parses HTML as input and returns clean, easy-to-read text☆170Updated 11 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆117Updated last week
- Efficient training of Support Vector Machines in Java☆117Updated 5 years ago
- Java natural language date parser☆523Updated last year
- Java implementation of the TextRank algorithm by Mihalcea, et al.☆75Updated 4 years ago
- Pure Java implementation of Van Der Maaten and Hinton's t-sne clustering algorithm☆197Updated 2 years ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆209Updated 5 months ago