gurgeous / simhilarityLinks
Measure text similarity using weighted ngrams.
☆18Updated 11 years ago
Alternatives and similar repositories for simhilarity
Users that are interested in simhilarity are comparing it to the libraries listed below
Sorting:
- A ruby/c extension to Christian Borgelt's apriori item-set implementation☆55Updated 15 years ago
- This project is a Ruby gem ('hmm') for machine learning that natively implements a (somewhat) generalized Hidden Markov Model classifier.…☆26Updated 15 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- Ruby implementation of the PageRank and TextRank algorithms.☆75Updated 7 months ago
- Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.☆584Updated last year
- Compare image similarity with a dhash☆91Updated 3 years ago
- Plug & Play Anomaly Detection. Not maintained by the author at the moment. Feel free to fork or submit PRs.☆39Updated last year
- A document vector search with flexible matrix transforms. Currently supports Latent semantic analysis and Term frequency - inverse docume…☆149Updated 5 years ago
- Wikidata and Wikipedia API client.☆35Updated 2 years ago
- The Summarizer from the Web IR / NLP Group (WING), hence SWING, is a modular, state-of-the-art automatic extractive text summarization sy…☆38Updated 11 years ago
- A JRuby command line application and library for Apache Tika to extract text and metadata from files of various formats.☆54Updated 8 months ago
- Ruby Binding for Stanford Pos-Tagger and Name Entity Recognizer☆92Updated 11 years ago
- A dialog system framework for conversational services.☆61Updated 8 years ago
- Lemmatizer for text in English. Inspired by Python's nltk.corpus.reader.wordnet.morphy☆112Updated 4 years ago
- A sanitizing sandbox for executing Ruby code☆20Updated 13 years ago
- Simple Ruby client for Wikidata☆35Updated last year
- Ruby port of UEALite Stemmer - a conservative stemmer for search and indexing☆54Updated last week
- A pure Ruby implementation of the Aho-Corasick string matching algorithm☆34Updated 9 years ago
- A library for generating fake data such as names, addresses and much more.☆12Updated 6 years ago
- Wikipedia information extraction library☆175Updated last year
- A Ruby wrapper for Latent Dirichlet Allocation (LDA).☆134Updated 5 years ago
- Ruby wrapper for correcting spelling and grammar mistakes based on the context of complete sentences.☆477Updated 6 years ago
- Polipus: distributed and scalable web-crawler framework☆92Updated 10 years ago
- Expose libstemmer_c to Ruby☆250Updated 3 years ago
- Calculate similarity between documents using TF-IDF weights☆116Updated last year
- A scalable and shareable repository of text annotation☆34Updated last week
- A simple tokenizer in Ruby for NLP tasks.☆46Updated 8 years ago
- JRuby Mahout is a gem that unleashes the power of Apache Mahout in the world of JRuby.☆165Updated 10 years ago
- Machine learning and data mining algorithms for JRuby☆92Updated 8 years ago
- Implementation of the Rapid Automatic Keyword Extraction algorithm in Ruby, a multi-word keywords extraction.☆37Updated 12 years ago