This is a language detection library implemented in plain Java. (aliases: language identification, language guessing)
☆763Feb 25, 2019Updated 7 years ago
Alternatives and similar repositories for language-detection
Users that are interested in language-detection are comparing it to the libraries listed below
Sorting:
- Language Detection Library for Java☆586Jul 23, 2022Updated 3 years ago
- Compact Language Detector 2☆894May 22, 2021Updated 4 years ago
- Language Detection with Infinity-gram☆230Jul 9, 2015Updated 10 years ago
- Stand-alone language identification system☆2,452Jan 1, 2020Updated 6 years ago
- A language detection Web Service☆53May 9, 2017Updated 8 years ago
- A plugin for language detection in Elasticsearch using Nakatani Shuyo's language detector☆253Dec 12, 2017Updated 8 years ago
- ☆873May 24, 2023Updated 2 years ago
- A language detection library for the JVM☆36Aug 21, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/chromium-compact-language-detector☆161Oct 1, 2020Updated 5 years ago
- Library for fast text representation and classification.☆26,501Mar 22, 2024Updated last year
- ☆16Sep 6, 2017Updated 8 years ago
- The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike☆800Mar 21, 2025Updated 11 months ago
- CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.☆10,059Feb 10, 2026Updated 3 weeks ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Jul 28, 2017Updated 8 years ago
- Creates a Lucene index out of files from a local folder☆13Aug 8, 2014Updated 11 years ago
- ☆178Mar 28, 2025Updated 11 months ago
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch☆1,525Feb 19, 2026Updated 2 weeks ago
- Work in progress transmit from Google Code☆1,128Jan 3, 2018Updated 8 years ago
- GSoC'16 RedHen Labs☆11Aug 22, 2016Updated 9 years ago
- A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum☆17Jul 25, 2014Updated 11 years ago
- Multilingual text (NLP) processing toolkit☆2,366Nov 10, 2023Updated 2 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,971May 23, 2024Updated last year
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Feb 10, 2026Updated 3 weeks ago
- View functions specs at your browser☆15Feb 15, 2018Updated 8 years ago
- A friendly Clojurescript toolkit☆13Dec 3, 2022Updated 3 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆32Jun 28, 2016Updated 9 years ago
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆29Mar 20, 2019Updated 6 years ago
- Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and …☆14,206Updated this week
- A new solr multilingual index and search architecture, it can support index and search across multiple languages at the same time in the …☆13Oct 18, 2019Updated 6 years ago
- Distributed processing framework for search solutions☆82Dec 16, 2022Updated 3 years ago
- A large-scale statistical machine translation system written in Java.☆213Dec 12, 2021Updated 4 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,283Updated this week
- NLP tools developed by Emory University.☆61Jul 30, 2016Updated 9 years ago
- ☆16Aug 30, 2017Updated 8 years ago
- Relay webhooks to Slack webhooks☆21Jul 19, 2017Updated 8 years ago
- An example full stack Clojure & Clojurescript project for Google App Engine Standard (Java).☆16Mar 17, 2019Updated 6 years ago
- SemanticVectors creates semantic WordSpace models from free natural language text.☆221Sep 21, 2022Updated 3 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Jul 11, 2018Updated 7 years ago