spencermountain / dumpster-dive
roll a wikipedia dump into mongo
☆240Updated 2 months ago
Related projects: ⓘ
- a pretty-committed wikipedia markup parser☆770Updated 2 months ago
- 🎀 JavaScript API for spaCy with Python REST API☆193Updated last year
- Expose Spacy nlp text parsing to Nodejs (and other languages) via socketIO☆225Updated last year
- spaCy REST API, wrapped in a Docker container.☆264Updated last year
- FastText for Node.js☆192Updated last year
- command-line tool to extract taxonomies from Wikidata☆124Updated 5 years ago
- varied english texts for modern NLP testing☆73Updated 2 years ago
- English NLP for Node.js and the browser.☆86Updated 10 months ago
- tools for working with Princeton's lexical database WordNet☆74Updated 6 years ago
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆204Updated 11 months ago
- JS utils functions to query a Wikibase instance and simplify its results☆324Updated last month
- A module for node.js and the browser that takes in text and strips it of stopwords☆230Updated this week
- Multilingual tokenizer that automatically tags each token with its type☆59Updated last year
- LDA topic modeling for node.js☆291Updated last month
- CLDR text segmentation for JavaScript☆38Updated 4 months ago
- text mining utilities for Node.js☆141Updated last year
- WordNet Database files (previously WNdb)☆215Updated 4 years ago
- A Wordnet API in pure JavaScript☆106Updated last year
- WordNet in JSON format.☆90Updated 4 years ago
- displaCy.js: An open-source NLP visualiser for the modern web☆342Updated 6 years ago
- fasttag part of speech tagger javascript implementation☆279Updated 4 years ago
- TextRank algorithm implementation in Javascript☆40Updated 9 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated 2 months ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 6 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript