carrotsearch / langid-javaLinks
Java port of langid.py (language identifier)
☆28Updated 12 years ago
Alternatives and similar repositories for langid-java
Users that are interested in langid-java are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆582Updated 3 years ago
- This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)☆758Updated 6 years ago
- Word2Vec Java Port☆190Updated 7 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆58Updated 3 years ago
- NLP framework for JVM languages.☆151Updated 4 years ago
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆43Updated 12 years ago
- Java version of LIBLINEAR☆307Updated 8 months ago
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆356Updated 4 years ago
- Java interface for fastText☆244Updated 2 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆189Updated 2 years ago
- Java interface for CRFsuite: http://www.chokkan.org/software/crfsuite/☆44Updated 8 years ago
- A large-scale statistical machine translation system written in Java.☆212Updated 3 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆247Updated this week
- Java text categorization system☆57Updated 8 years ago
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆129Updated last year
- Java/JNI bindings to libpostal for for fast international street address parsing/normalization☆126Updated 2 months ago
- A generic Java toolkit for building dialogue systems☆198Updated last year
- Apache Joshua☆109Updated 5 years ago
- TextTeaser is an automatic summarization algorithm.☆1,978Updated 7 years ago
- CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, rel…☆478Updated 2 years ago
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆200Updated 2 months ago
- Java natural language date parser☆526Updated last year
- Machine learning components for Apache UIMA☆131Updated 2 years ago
- Facebook's FastText for Java☆81Updated 7 years ago
- A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)☆176Updated 2 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆194Updated last week
- A port of the arclabs 'readability' package to Java☆73Updated 13 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Adds line-breaking, page-breaking, tables, and styles to PDFBox☆47Updated 2 years ago
- Easy-to-use Java library for similarity checking of strings or numeric-series☆20Updated 5 years ago