shuyo / language-detectionLinks
This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
☆753Updated 6 years ago
Alternatives and similar repositories for language-detection
Users that are interested in language-detection are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆578Updated 2 years ago
- Compact Language Detector 2☆863Updated 4 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆162Updated 4 years ago
- Language Detection with Infinity-gram☆230Updated 9 years ago
- Apache OpenNLP☆1,513Updated last week
- Java interface for fastText☆237Updated last year
- Word2Vec Java Port☆186Updated 7 years ago
- Port of Google's language-detection library to Python.☆1,804Updated 3 months ago
- Stand-alone language identification system☆2,386Updated 5 years ago
- ☆828Updated 2 years ago
- MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, informat…☆1,003Updated last year
- extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.☆128Updated last year
- All languages stopwords collection☆445Updated last year
- DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text.☆758Updated 7 years ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆192Updated last year
- CMU ARK Twitter Part-of-Speech Tagger☆575Updated last year
- Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.☆198Updated 6 months ago
- ☆184Updated 6 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 2 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- A large-scale statistical machine translation system written in Java.☆209Updated 3 years ago
- Twitter NLP Tools☆889Updated 2 years ago
- SemanticVectors creates semantic WordSpace models from free natural language text.☆217Updated 2 years ago
- An extremely fast implementation of Aho Corasick algorithm based on Double Array Trie.☆981Updated 3 years ago
- This tool extracts word vectors from Lucene index.☆135Updated 7 years ago
- Apache Joshua☆107Updated 4 years ago
- Heuristic based boilerplate removal tool☆780Updated 3 months ago
- SymSpell v6.4ish ported to Java 8. Will be a module in my Master Thesis.☆24Updated 5 years ago
- Just the facts -- web page content extraction☆1,266Updated 11 months ago
- Java port of langid.py (language identifier)☆28Updated 12 years ago