raymanrt / aho-corasick
Aho-Corasick algorithm as implemented in Java by Danny Yoo, with little improvements
☆26Updated 10 years ago
Related projects: ⓘ
- A language detection library for the JVM☆35Updated last year
- Classifier4J is a Java library designed to do text classification. It comes with an implementation of a Bayesian classifier, and now has …☆11Updated 8 years ago
- A port of the arclabs 'readability' package to Java☆72Updated 12 years ago
- BK-tree Java library☆28Updated 10 years ago
- This is my main Java library for all kinds of datastructures, algorithms and everything else that I need.☆73Updated last year
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆214Updated last year
- Java implementation of famous fuzzy wuzzy algorithm -- http://seatgeek.com/blog/dev/fuzzywuzzy-fuzzy-string-matching-in-python☆15Updated 8 years ago
- Java text categorization system☆54Updated 7 years ago
- JAVA implementation of Multinomial Naive Bayes Text Classifier.☆95Updated 9 years ago
- Examples of how to use many different threading operators in RxJava☆27Updated 10 years ago
- # Vert.x 2.x is **deprecated** - use instead☆110Updated 4 years ago
- Word2Vec Java Port☆186Updated 6 years ago
- A bundle of html content extraction algorithms☆122Updated 9 years ago
- A generic Tf-Idf utility with example code that works on n-grams extracted from a text document.☆23Updated 10 years ago
- Mensa is a generic, flexible, enhanced, and efficient Java implementation of a pattern matching state machine as described by the 1975 pa…☆94Updated 9 years ago
- Practical Algorithm to Retrieve Information Coded in Alphanumeric (PATRICIA)☆176Updated 5 years ago
- Machine learning components for Apache UIMA☆129Updated last year
- Gradle project producing two jars from single source directory☆14Updated 9 years ago
- Mavenized version of Kelvin Tan's example (http://www.lucenetutorial.com/lucene-in-5-minutes.html)☆69Updated last month
- Java port of langid.py (language identifier)☆28Updated 11 years ago
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 11 years ago
- PredictionIO Java SDK☆106Updated 6 years ago
- A Gradle plugin for the Java annotation processor tool☆47Updated 9 years ago
- RxFsm is a hierarchical finite state machine (FSM) library built on top of RxJava.☆38Updated 8 years ago
- A Java implementation of a Double Array Trie☆122Updated 13 years ago
- Small useful things for Java☆130Updated 5 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 2 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- Integration between Stanford NLP and Apache Stanbol☆33Updated 8 years ago
- This provides tools for b-bit MinHash algorism.☆33Updated 8 months ago