carrotsearch / langid-javaLinks
Java port of langid.py (language identifier)
☆28Updated 12 years ago
Alternatives and similar repositories for langid-java
Users that are interested in langid-java are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆582Updated 3 years ago
- Word2Vec Java Port☆190Updated 7 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆759Updated 6 years ago
- Java text categorization system☆57Updated 8 years ago
- Apache Joshua☆109Updated 5 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆129Updated last year
- A large-scale statistical machine translation system written in Java.☆212Updated 3 years ago
- NLP framework for JVM languages.☆151Updated 4 years ago
- Apache OpenNLP Sandbox☆44Updated last week
- Java version of LIBLINEAR☆307Updated 9 months ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆43Updated 12 years ago
- A language detection library for the JVM☆36Updated 2 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆1,011Updated 3 months ago
- A set of reusable Java components that implement functionality common to any web crawler☆246Updated 2 weeks ago
- KEA - Keyphrase Extraction Algorithm☆23Updated 9 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆88Updated this week
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆358Updated 4 years ago
- Java interface for fastText☆244Updated 2 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆200Updated 2 months ago
- A Java library implementing practical nearest neighbour search algorithm for multidimensional vectors that operates in sublinear time. It…☆201Updated 5 years ago
- Machine learning components for Apache UIMA☆131Updated 2 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆196Updated 2 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆58Updated 3 years ago
- CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, rel…☆479Updated 2 years ago
- Pure Java implementation of Van Der Maaten and Hinton's t-sne clustering algorithm☆198Updated 2 years ago
- A convenience Java wrapper around GloVe word vectors and converter to more space efficient binary files.☆25Updated 4 years ago
- Facebook's FastText for Java☆81Updated 7 years ago
- small Java library for splitting German compound words☆63Updated last year
- Program used to split text into segments☆27Updated 11 months ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆194Updated last week