optimaize / language-detectorLinks
Language Detection Library for Java
☆584Updated 3 years ago
Alternatives and similar repositories for language-detector
Users that are interested in language-detector are comparing it to the libraries listed below
Sorting:
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆760Updated 6 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆131Updated last year
- Java implementation of the Aho-Corasick algorithm for efficient string matching☆963Updated 6 months ago
- A set of reusable Java components that implement functionality common to any web crawler☆248Updated last week
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆357Updated 4 years ago
- A java classifier based on the naive Bayes approach complete with Maven support and a runnable example.☆299Updated 5 years ago
- Word2Vec Java Port☆190Updated 7 years ago
- Java natural language date parser☆526Updated last year
- A language detection library for the JVM☆36Updated 2 years ago
- Readability clone in Java☆460Updated 5 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆197Updated 2 years ago
- A Java library to detect and normalize URLs in text☆782Updated 4 months ago
- Various utilities regarding Levenshtein transducers. (Java)☆58Updated 3 years ago
- PATRICIA, Double Array, LOUDS Trie implementations for Java☆180Updated last year
- Java Perceptual Hash☆90Updated 8 years ago
- A Java implementation of Locality Sensitive Hashing (LSH)☆300Updated 3 years ago
- UADetector is a library to identify over 190 different desktop and mobile browsers and 130 other User-Agents like feed readers, email cli…☆248Updated 3 years ago
- Java version of LIBLINEAR☆307Updated 10 months ago
- Solr query parser plugin that performs proper query-time synonym expansion.☆149Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- Efficient training of Support Vector Machines in Java☆119Updated 5 years ago
- Concurrent Radix and Suffix Trees for Java☆515Updated 4 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆1,018Updated 5 months ago
- Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statisti…☆1,087Updated 2 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆44Updated 12 years ago
- A Java library that implements several algorithms that calculate similarity between strings.☆160Updated 4 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆274Updated 3 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆191Updated 3 years ago
- A simple implementation of simhash algorithm by java.☆155Updated 5 years ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆212Updated 5 months ago