carrotsearch / langid-java
Java port of langid.py (language identifier)
☆28Updated 11 years ago
Alternatives and similar repositories for langid-java:
Users that are interested in langid-java are comparing it to the libraries listed below
- Apache OpenNLP Sandbox☆43Updated this week
- Word2Vec Java Port☆186Updated 6 years ago
- Apache Joshua☆106Updated 4 years ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- Java text categorization system☆55Updated 7 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- This tool extracts word vectors from Lucene index.☆134Updated 7 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆42Updated 11 years ago
- Browser-driven explorer for lucene indexes☆74Updated 3 years ago
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆112Updated 10 months ago
- Querqy for Elasticsearch☆45Updated this week
- Machine learning components for Apache UIMA☆129Updated last year
- Java interface for fastText☆231Updated last year
- Software and resources for natural language processing.☆131Updated 8 years ago
- A text tagger based on Lucene / Solr, using FST technology☆176Updated last year
- Solr query parser plugin that performs proper query-time synonym expansion.☆150Updated 3 years ago
- Java implementation of the TextRank algorithm by Mihalcea, et al.☆75Updated 3 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆57Updated 3 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆243Updated 2 months ago
- ☆184Updated 6 years ago
- Automatically exported from code.google.com/p/berkeleylm☆98Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Approximate nearest neighbors in Java☆138Updated 4 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- Auto tagging with OpenNPL☆16Updated 11 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆66Updated 4 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated 11 months ago
- A large-scale statistical machine translation system written in Java.☆208Updated 3 years ago
- Stanford Pattern-based Information Extraction and Diagnostics -- Visualization☆93Updated 10 years ago