gurgeous / simhilarity
Measure text similarity using weighted ngrams.
☆18Updated 11 years ago
Alternatives and similar repositories for simhilarity
Users that are interested in simhilarity are comparing it to the libraries listed below
Sorting:
- Plug & Play Anomaly Detection. Not maintained by the author at the moment. Feel free to fork or submit PRs.☆39Updated last year
- A JRuby command line application and library for Apache Tika to extract text and metadata from files of various formats.☆53Updated 2 weeks ago
- A ruby/c extension to Christian Borgelt's apriori item-set implementation☆55Updated 15 years ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated this week
- Wikidata and Wikipedia API client.☆35Updated last year
- This project is a Ruby gem ('hmm') for machine learning that natively implements a (somewhat) generalized Hidden Markov Model classifier.…☆26Updated 15 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Updated 8 years ago
- A document vector search with flexible matrix transforms. Currently supports Latent semantic analysis and Term frequency - inverse docume…☆150Updated 4 years ago
- An implementation of the MinHash algorithm in ruby using Murmur Hash☆25Updated 16 years ago
- Easy autocomplete: redis, ruby,☆73Updated 5 months ago
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆54Updated 2 years ago
- A pure Ruby implementation of the Aho-Corasick string matching algorithm☆34Updated 8 years ago
- Simple Ruby client for Wikidata☆35Updated last year
- Ruby gem to semi-automatically redact confidential information from a text☆14Updated 8 years ago
- Namae (名前) parses personal names and splits them into their component parts.☆163Updated last year
- Polipus: distributed and scalable web-crawler framework☆92Updated 9 years ago
- JRuby Mahout is a gem that unleashes the power of Apache Mahout in the world of JRuby.☆165Updated 9 years ago
- A fast and accurate rule-based sentence segmentation tool for Ruby.☆51Updated this week
- Compare image similarity with a dhash☆93Updated 2 years ago
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆133Updated 4 years ago
- A redis-backed Bayesian classifier☆38Updated 9 years ago
- Machine learning and data mining algorithms for JRuby☆92Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Locality Sensitive Hashing in Ruby☆33Updated 11 years ago
- Google Protocol Buffers integration for Active Record.☆58Updated last month
- Simple random number generator gem for Ruby (based on C# code by John D. Cook).☆37Updated 6 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 11 years ago
- annoy-rb provides Ruby bindings for the Annoy (Approximate Nearest Neighbors Oh Yeah).☆35Updated 4 months ago
- Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy☆108Updated 3 years ago
- A pure Ruby interface to the WordNet database☆90Updated 5 years ago