carrotsearch / langid-java
Java port of langid.py (language identifier)
☆28Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for langid-java
- Java text categorization system☆54Updated 7 years ago
- NLP framework for JVM languages.☆148Updated 3 years ago
- Various utilities regarding Levenshtein transducers. (Java)☆56Updated 2 years ago
- A language detection library for the JVM☆36Updated last year
- A fast and comprehensive Java library capable of performing automaton and non-automaton based Levenshtein distance determination and neig…☆41Updated 11 years ago
- NLP tools developed by Emory University.☆60Updated 8 years ago
- Apache Joshua☆105Updated 4 years ago
- Apache OpenNLP Sandbox☆42Updated this week
- Program used to split text into segments☆25Updated 3 weeks ago
- Machine learning components for Apache UIMA☆129Updated last year
- Automatically exported from code.google.com/p/berkeleylm☆98Updated 8 years ago
- A dependency tree visualizer for the Stanford Typed-Dependency Parser☆68Updated last week
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆80Updated 6 years ago
- Java interface for CRFsuite: http://www.chokkan.org/software/crfsuite/☆43Updated 7 years ago
- Word2Vec Java Port☆186Updated 6 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 9 years ago
- Common web archive utility code.☆50Updated last month
- ☆184Updated 6 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 3 years ago
- Thot toolkit for statistical machine translation☆50Updated 2 years ago
- Base modules of JCoRe☆22Updated 6 months ago
- A bundle of html content extraction algorithms☆121Updated 9 years ago
- Word and text similarity measures☆54Updated 2 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆25Updated 2 years ago
- Java 8+ zero-dependency port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆19Updated 4 months ago
- NEWS: JATE2.0 Beta.11 Released, see details below.☆81Updated last year
- An efficient and flexible token-based regular expression language and engine.☆75Updated 10 years ago
- Partial Java port of the C++ OpenFST library☆36Updated 2 years ago
- BK-tree Java library☆29Updated 11 years ago
- Java interface for fastText☆229Updated last year