duhaime / minhash
Quickly estimate the similarity between many sets
☆52Updated 2 years ago
Alternatives and similar repositories for minhash:
Users that are interested in minhash are comparing it to the libraries listed below
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆24Updated last year
- Multilingual tokenizer that automatically tags each token with its type☆61Updated 2 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- PageRank calculation for ngraph.graph☆28Updated 2 months ago
- Node wrapper around FastText Library☆57Updated 2 years ago
- This stemmming module for Node.js provides stemming capability for a variety of languages using Dr. M.F. Porter's Snowball API.☆51Updated last month
- Simhash implementation in Javascript☆38Updated 7 years ago
- node module for geospatial indexing with leveldb☆35Updated 2 years ago
- Fast Full Text Search based on BM25☆63Updated 2 years ago
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- A Canvas-based pHash Implementation☆12Updated 8 months ago
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated last year
- Nodejs module for Extracting Concepts from text.☆10Updated last year
- Image perceptual hash calculation in javascript☆172Updated 4 years ago
- Throw JavaScript objects at the index and they will become retrievable by their properties using promises and map-reduce☆19Updated 2 months ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 5 years ago
- Accurate and fast sentiment scoring of phrases with #hashtags, emoticons :) & emojis 🎉☆62Updated 2 years ago
- A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.☆103Updated last year
- A list of all SciJS packages. Based on @hughsk's stack.gl/packages☆25Updated 9 years ago
- Tunable full text search engine in JavaScript that: (1) works natively on web apps like Express.js; (2) easy to customize (via BM25) to s…☆34Updated 6 years ago
- tools for working with Princeton's lexical database WordNet☆73Updated 6 years ago
- generate rules from lists of words☆16Updated 3 years ago
- Fastest way to fetch the web content(HTML stream) from server, supports:redirects, auto decode(e.g.:Chinese), gzip, cookie, proxy...☆33Updated 4 years ago
- k-means clustering algorithm with k-means++ initialization.☆32Updated 2 years ago
- A node.js module that creates a term vector from a mixed text input. Supports stopword removal and customisable separators.☆19Updated 5 months ago
- Fast & numerically stable statistical analysis☆46Updated 2 years ago
- Locality-Sensitive Hashing implementation in node.js for fast and scalable approximate nearest neighbors search☆11Updated 6 years ago
- Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.☆326Updated 2 months ago
- SVM Classifier to Detect Sentiment of Tweets☆16Updated 10 years ago
- ☆11Updated 6 years ago