shuyo / language-detection
This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
☆748Updated 6 years ago
Alternatives and similar repositories for language-detection:
Users that are interested in language-detection are comparing it to the libraries listed below
- Language Detection Library for Java☆575Updated 2 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Updated 4 years ago
- Port of Google's language-detection library to Python.☆1,772Updated 3 weeks ago
- Compact Language Detector 2☆855Updated 3 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆996Updated last year
- Language Detection with Infinity-gram☆231Updated 9 years ago
- ☆811Updated last year
- Work in progress transmit from Google Code☆1,114Updated 7 years ago
- Apache OpenNLP☆1,496Updated this week
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- Similarity or Distance Metrics, e.g. Levenshtein, for Java☆345Updated 3 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆243Updated last week
- Stand-alone language identification system☆2,367Updated 5 years ago
- ☆184Updated 6 years ago
- CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, rel…☆475Updated last year
- Java interface for fastText☆232Updated last year
- Fast Entity Linker Toolkit for training models to link entities to KnowledgeBase (Wikipedia) in documents and queries.☆338Updated 4 years ago
- Generating Vectors for DBpedia Entities via Word2Vec and Wikipedia Dumps. Questions? https://gitter.im/idio-opensource/Lobby☆600Updated 7 years ago
- A python implementation of the Rapid Automatic Keyword Extraction☆975Updated 4 years ago
- A multilingual, cross-domain temporal tagger developed at the Database Systems Research Group at Heidelberg University.☆344Updated last year
- Data for Automatic Keyphrase Extraction Task☆336Updated 6 years ago
- DKPro JWPL (DKPro Java Wikipedia Library) is a free, Java-based application programming interface that facilitates access to all informat…☆83Updated 5 months ago
- TextRank implementation for Python 3.☆1,255Updated last year
- Word2Vec Java Port☆186Updated 6 years ago
- Quality information extraction at web scale. Edit☆327Updated 7 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆198Updated 4 months ago
- Multilingual text (NLP) processing toolkit☆2,330Updated last year
- Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jo…☆257Updated 5 years ago
- CRF++: Yet Another CRF toolkit☆506Updated last month