winkjs / wink-tokenizer
Multilingual tokenizer that automatically tags each token with its type
☆61Updated 2 years ago
Alternatives and similar repositories for wink-tokenizer:
Users that are interested in wink-tokenizer are comparing it to the libraries listed below
- Language agnostic named entity recognizer☆39Updated 2 years ago
- Naive Bayes Text Classifier☆40Updated 2 months ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆127Updated last year
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- Fast Full Text Search based on BM25☆63Updated 2 years ago
- Accurate and fast sentiment scoring of phrases with #hashtags, emoticons :) & emojis 🎉☆62Updated 2 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- English lexicon useful in NLP/NLU☆15Updated last year
- CLDR text segmentation for JavaScript☆38Updated last year
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆34Updated 5 years ago
- tools for working with Princeton's lexical database WordNet☆73Updated 6 years ago
- An Implementation of Jaro Distance Algorithm by Matthew A. Jaro☆13Updated 3 years ago
- Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter☆20Updated 2 years ago
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated last year
- Node bindings for Annoy, an efficient Approximate Nearest Neighbors implementation written in C++.☆80Updated last year
- PageRank calculation for ngraph.graph☆28Updated last month
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- Fast & numerically stable statistical analysis☆46Updated 2 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆54Updated 8 years ago
- A suite of modules for text analysis, including simple analysis, nGrams, and TFIDF analysis☆48Updated 4 years ago
- Decision Tree to predict the value of a continuous target variable☆16Updated 2 years ago
- English lemmatizer☆66Updated last year
- A semi-unsupervised language independent morphological analyzer useful for stemming unknown language text, or getting a rough estimate of…☆21Updated 7 years ago
- English Part-of-speech (POS) tagger☆66Updated 2 years ago
- Tool for grouping similar items.☆24Updated 2 months ago
- generate rules from lists of words☆16Updated 3 years ago
- Multi-class classifier☆13Updated 2 years ago
- English NLP for Node.js and the browser.☆87Updated last year
- FastText for Node.js☆196Updated 2 years ago
- HTML5 Canvas implementation for NodeJS backed by Puppeteer☆65Updated last year