eklem / stopword-trainer
A module for creating stopword lists for any language, based on a set of documents.
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for stopword-trainer
- Minimalistic trie implementation for prefix searches☆13Updated 7 years ago
- A semi-unsupervised language independent morphological analyzer useful for stemming unknown language text, or getting a rough estimate of…☆21Updated 6 years ago
- This stemmming module for Node.js provides stemming capability for a variety of languages using Dr. M.F. Porter's Snowball API.☆50Updated 8 months ago
- Node.js wrapper for Wikipedia API☆45Updated 6 years ago
- text mining utilities for Node.js☆143Updated last year
- Client for Stanford Named Entity Reconginiton☆27Updated 6 years ago
- English NLP for Node.js and the browser.☆86Updated last year
- A module for node.js and the browser that takes in text and strips it of stopwords☆231Updated last month
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated last year
- generate rules from lists of words☆16Updated 3 years ago
- Language agnostic named entity recognizer☆39Updated last year
- A node.js module that creates a term vector from a mixed text input. Supports stopword removal and customisable separators.☆19Updated 5 years ago
- bag-of-words calculator in javascript☆136Updated 4 years ago
- Throw JavaScript objects at the index and they will become retrievable by their properties using promises and map-reduce☆19Updated 2 months ago
- A lightweight JavaScript client library for the Wikimedia Pageviews API for Wikipedia and various of its sister projects for Node.js and …☆26Updated 3 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆123Updated 8 months ago
- Vanilla JavaScript implementation of the Weighted PageRank Algorithm☆30Updated 5 years ago
- Deprecated plugin to detect sentiment: use `words/polarity`☆97Updated 2 weeks ago
- Node wrapper around FastText Library☆57Updated last year
- tools for working with Princeton's lexical database WordNet☆74Updated 6 years ago
- Predictive text in JavaScript☆29Updated 12 years ago
- List of (possible) English hedge words☆44Updated 2 years ago
- List of easy American-English words: The New Dale-Chall (1995)☆32Updated 2 years ago
- Nodejs binding for fasttext representation and classification.☆43Updated 8 months ago
- ☆33Updated 11 years ago
- Node.js wrapper for the DuckDuckGo Instant Answers API.☆64Updated last year
- Fast Porter stemmer implementation☆129Updated 2 years ago
- 🤬 Map of profane words to a rating of sureness☆168Updated last year
- Get all email addresses in a string☆58Updated 3 years ago
- JavaScript implementation of Frank Denis' (@jedisct1) minisign tool.☆83Updated last year