uttesh / exudeLinks
Simple java library to filter the stopping,stemming words from input data or file and link
☆22Updated 6 years ago
Alternatives and similar repositories for exude
Users that are interested in exude are comparing it to the libraries listed below
Sorting:
- Java text categorization system☆56Updated 8 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆217Updated 2 years ago
- Code examples on how to use the Datumbox Machine Learning Framework.☆41Updated last year
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆27Updated 6 years ago
- Java Execution Time Measurement Library☆59Updated last year
- Distributed Realtime Search with Lucene and MongoDB☆59Updated 7 years ago
- A java classifier based on the naive Bayes approach complete with Maven support and a runnable example.☆298Updated 4 years ago
- SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆18Updated 10 years ago
- A new object-graph-wrapper for the Tinkerpop 3 graph stack.☆40Updated 4 years ago
- A Maven plugin to run a single node Elasticsearch cluster during the integration test phase of a build☆89Updated last month
- A language detection library for the JVM☆36Updated last year
- Bloom filters for Java☆65Updated last year
- JAVA implementation of Multinomial Naive Bayes Text Classifier.☆95Updated 10 years ago
- Java library for parsing semi-structured text files☆65Updated 3 years ago
- A lightweight Java library for detecting mobile devices.☆24Updated 8 years ago
- Async HTTP server/client - high performance in functional style, full-featured.☆39Updated 9 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆36Updated 10 years ago
- A lightweight and easy to use full text search implementation for Java. Uses inverted index and cosine similarity w/ TFIDF ranking.☆52Updated 7 years ago
- The FFPOJO Project is a Flat-File Parser, POJO based, library for Java applications.☆68Updated last year
- A model-view based code generator written in Java☆40Updated 8 years ago
- Ultra-Light JDBC Persistance Layer☆135Updated 4 months ago
- A high performance "thin wrapper" HTTP REST server on top of Apache Lucene☆143Updated last year
- Java Nio FileSystem for accessing github☆45Updated 10 years ago
- An ORM / OGM for the TinkerPop graph stack.☆137Updated 3 years ago
- A set of reusable Java components that implement functionality common to any web crawler☆244Updated 2 weeks ago
- Spark Tutorial Collection☆94Updated 2 years ago
- YCB Java☆27Updated 2 years ago
- Library for building efficient regular-expression based extractors by combining multiple REs into single automaton☆24Updated 3 years ago
- A small program to detect gibberish using a Markov Chain☆27Updated 6 years ago
- Write parsers for arbitrary text inputs, entirely in Java, with no preprocessing phase☆65Updated 9 years ago