carrotsearch / langid-javaLinks
Java port of langid.py (language identifier)
☆28Updated 12 years ago
Alternatives and similar repositories for langid-java
Users that are interested in langid-java are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆585Updated 3 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆760Updated 6 years ago
- Word2Vec Java Port☆191Updated 7 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆45Updated 12 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆131Updated last year
- Java version of LIBLINEAR☆308Updated 11 months ago
- A language detection library for the JVM☆36Updated 2 years ago
- Apache Joshua☆110Updated 5 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆251Updated last week
- A large-scale statistical machine translation system written in Java.☆212Updated 4 years ago
- Java text categorization system☆57Updated 8 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆131Updated 5 months ago
- Various utilities regarding Levenshtein transducers. (Java)☆58Updated 4 years ago
- NLP framework for JVM languages.☆152Updated 4 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆191Updated last week
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Java interface for fastText☆244Updated 2 years ago
- Hunspell library for Java based on JNA☆63Updated 2 years ago
- Java autocomplete library.☆120Updated 5 years ago
- Browser-driven explorer for lucene indexes☆74Updated 4 years ago
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆357Updated 4 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆197Updated 2 years ago
- The Sweble Wikitext Components module provides a parser for MediaWiki's wikitext and an engine trying to emulate the behavior of a MediaW…☆72Updated last year
- Machine learning components for Apache UIMA☆132Updated 2 years ago
- Java natural language date parser☆526Updated 2 years ago
- Apache OpenNLP Sandbox☆45Updated this week
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆202Updated 5 years ago
- A library to read PST files with java, without need for external libraries.☆264Updated 3 years ago
- High-performance pattern matching algorithms in Java☆82Updated 5 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆1,019Updated last week