spencermountain / dumpster-diveLinks
roll a wikipedia dump into mongo
☆248Updated last year
Alternatives and similar repositories for dumpster-dive
Users that are interested in dumpster-dive are comparing it to the libraries listed below
Sorting:
- a pretty-committed wikipedia markup parser☆841Updated 3 months ago
- 🎀 JavaScript API for spaCy with Python REST API☆197Updated 2 years ago
- Expose Spacy nlp text parsing to Nodejs (and other languages) via socketIO☆227Updated 2 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆56Updated 8 years ago
- Multilingual tokenizer that automatically tags each token with its type☆62Updated 2 years ago
- JS utils functions to query a Wikibase instance and simplify its results☆337Updated 3 weeks ago
- varied english texts for modern NLP testing☆78Updated 3 years ago
- LDA topic modeling for node.js☆297Updated last year
- English NLP for Node.js and the browser.☆86Updated 2 years ago
- text mining utilities for Node.js☆142Updated 2 years ago
- FastText for Node.js☆198Updated 2 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆131Updated last year
- plugin to extract keywords and key-phrases☆334Updated last year
- tools for working with Princeton's lexical database WordNet☆73Updated 7 years ago
- command-line tool to extract taxonomies from Wikidata☆128Updated 6 years ago
- Word embeddings for the web☆28Updated 2 years ago
- Get n-grams from text☆83Updated 2 years ago
- fasttag part of speech tagger javascript implementation☆281Updated 5 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆104Updated last month
- List of emoji rated for valence☆123Updated 2 years ago
- One trick pony NLP library for extracting keywords from HTML documents☆18Updated 9 years ago
- WordNet in JSON format.☆93Updated 5 years ago
- A Wordnet API in pure JavaScript☆109Updated 2 years ago
- CoreNLP @ NodeJS☆66Updated 2 years ago
- A client for the Stanford Part of Speech Tagger XMLRPC server.☆72Updated 8 years ago
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- TextRank algorithm implementation in Javascript☆40Updated 10 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- Node.js interface to the Google word2vec tool.☆355Updated last year
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆98Updated 3 years ago