duhaime / minhashLinks
Quickly estimate the similarity between many sets
☆53Updated 2 years ago
Alternatives and similar repositories for minhash
Users that are interested in minhash are comparing it to the libraries listed below
Sorting:
- Multilingual tokenizer that automatically tags each token with its type☆62Updated 2 years ago
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆24Updated 2 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.☆104Updated 2 years ago
- Simularity identification in JS☆37Updated last year
- Fast Full Text Search based on BM25☆66Updated 2 years ago
- CoreNLP @ NodeJS☆66Updated 2 years ago
- Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.☆332Updated 7 months ago
- This stemmming module for Node.js provides stemming capability for a variety of languages using Dr. M.F. Porter's Snowball API.☆52Updated 6 months ago
- Node bindings for Annoy, an efficient Approximate Nearest Neighbors implementation written in C++.☆82Updated 2 years ago
- Image perceptual hash calculation in javascript☆174Updated 5 years ago
- English NLP for Node.js and the browser.☆86Updated last year
- bag-of-words calculator in javascript☆135Updated 5 years ago
- Throw JavaScript objects at the index and they will become retrievable by their properties using promises and map-reduce☆19Updated 2 months ago
- LDA topic modeling for node.js☆298Updated last year
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆131Updated last year
- Simhash implementation in Javascript☆39Updated 8 years ago
- text mining utilities for Node.js☆142Updated 2 years ago
- Accurate and fast sentiment scoring of phrases with #hashtags, emoticons :) & emojis 🎉☆62Updated 2 years ago
- PageRank calculation for ngraph.graph☆29Updated last week
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- Node wrapper around FastText Library☆57Updated 2 years ago
- Language agnostic named entity recognizer☆39Updated 2 years ago
- Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.☆142Updated last year
- Automatically extracts structured information from webpages☆109Updated 3 years ago
- Machine Learning, Natural Language Processing and Sentiment Analysis Toolkit for Node.js☆241Updated 9 years ago
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated 2 years ago
- WordNet Database files (previously WNdb)☆218Updated 5 years ago
- neato compression for key-value data☆107Updated last year
- Principal component analysis☆100Updated 11 months ago