shuyo / language-detectionLinks
This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
☆761Updated 6 years ago
Alternatives and similar repositories for language-detection
Users that are interested in language-detection are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆586Updated 3 years ago
- Compact Language Detector 2☆890Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 5 years ago
- Word2Vec Java Port☆192Updated 7 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆1,022Updated this week
- Java interface for fastText☆245Updated 2 years ago
- TextTeaser is an automatic summarization algorithm.☆1,975Updated 8 years ago
- Apache OpenNLP☆1,578Updated last week
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆131Updated last year
- This tool extracts word vectors from Lucene index.☆135Updated 8 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆199Updated 2 weeks ago
- Language Detection with Infinity-gram☆230Updated 10 years ago
- Work in progress transmit from Google Code☆1,127Updated 8 years ago
- ☆870Updated 2 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆88Updated this week
- ☆185Updated 7 years ago
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.☆760Updated 7 years ago
- An Elasticsearch ingest processor to do named entity extraction using Apache OpenNLP☆276Updated 3 years ago
- Heuristic based boilerplate removal tool☆811Updated 11 months ago
- A python implementation of the Rapid Automatic Keyword Extraction☆983Updated 5 years ago
- Apache Joshua☆111Updated 5 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆252Updated last week
- CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, rel…☆480Updated 2 years ago
- Java natural language date parser☆526Updated 2 years ago
- A bundle of html content extraction algorithms☆122Updated 10 years ago
- Simhash and near-duplicate detection☆423Updated 2 years ago
- Java version of LIBLINEAR☆308Updated last year
- Carrot2 plugin for ElasticSearch☆295Updated 3 years ago
- Java interface for CRFsuite: http://www.chokkan.org/software/crfsuite/☆44Updated 8 years ago
- Machine learning components for Apache UIMA☆132Updated 2 years ago