optimaize / language-detector
Language Detection Library for Java
☆575Updated 2 years ago
Alternatives and similar repositories for language-detector:
Users that are interested in language-detector are comparing it to the libraries listed below
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆749Updated 6 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- A set of reusable Java components that implement functionality common to any web crawler☆243Updated 3 weeks ago
- Java implementation of the Aho-Corasick algorithm for efficient string matching☆923Updated 11 months ago
- Word2Vec Java Port☆186Updated 6 years ago
- The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike☆743Updated last month
- A generic Tf-Idf utility with example code that works on n-grams extracted from a text document.☆23Updated 10 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆182Updated 2 years ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆202Updated 4 years ago
- Java Perceptual Hash☆88Updated 7 years ago
- A Java library that implements several algorithms that calculate similarity between strings.☆159Updated 4 years ago
- Readability clone in Java☆459Updated 4 years ago
- Snappy compressor/decompressor for Java☆1,057Updated last week
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆252Updated 7 years ago
- A simple implementation of simhash algorithm by java.☆155Updated 4 years ago
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆346Updated 3 years ago
- This tool extracts word vectors from Lucene index.☆134Updated 7 years ago
- A java classifier based on the naive Bayes approach complete with Maven support and a runnable example.☆296Updated 4 years ago
- A language detection library for the JVM☆36Updated last year
- Comparisons among all Java-based CSV parsers in existence☆272Updated 5 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆83Updated this week
- Hunspell library for Java based on JNA☆62Updated 2 years ago
- galimatias is a URL parsing and normalization library written in Java.☆162Updated last year
- Java port of langid.py (language identifier)☆28Updated 11 years ago
- Java library to extract links (URLs, email addresses) from plain text; fast, small and smart☆207Updated 5 months ago
- Repackaging of Boilerpipe published on Maven Central Repository.☆53Updated last year
- UADetector is a library to identify over 190 different desktop and mobile browsers and 130 other User-Agents like feed readers, email cli…☆247Updated 2 years ago
- This provides tools for b-bit MinHash algorism.☆35Updated last week
- Pure Java implementation of Van Der Maaten and Hinton's t-sne clustering algorithm☆197Updated 2 years ago