spencermountain / wtf_wikipedia
a pretty-committed wikipedia markup parser
☆805Updated 2 months ago
Alternatives and similar repositories for wtf_wikipedia:
Users that are interested in wtf_wikipedia are comparing it to the libraries listed below
- roll a wikipedia dump into mongo☆243Updated 9 months ago
- Part-of-speech utilities for node.js based on the WordNet database.☆476Updated 2 years ago
- Wikipedia Interface for Node.js☆316Updated 7 months ago
- plugin to extract keywords and key-phrases☆333Updated 5 months ago
- WordNet Database files (previously WNdb)☆216Updated 5 years ago
- Automatically extract body content (and other cool stuff) from an html document☆2,156Updated last year
- Count syllables in an English word☆239Updated 2 years ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated 6 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 6 years ago
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆211Updated last year
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆92Updated 10 months ago
- MediaWiki API and WikiData client written in Node.js☆241Updated last week
- Generates a quiz for a Wikipedia page using parts of speech and text chunking.☆803Updated 4 years ago
- fasttag part of speech tagger javascript implementation☆279Updated 4 years ago
- A module for node.js and the browser that takes in text and strips it of stopwords☆245Updated 3 months ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆54Updated 8 years ago
- 📝 Hunspell compatible spell-checker☆278Updated 4 years ago
- natural language processor powered by plugins part of the @unifiedjs collective☆2,396Updated 2 months ago
- Collaborative data curation for Glottolog☆160Updated last week
- World Factbook Country Profiles in JSON - Free Open Public Domain Data - No API Key Required ;-)☆1,026Updated last week
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.☆713Updated 9 months ago
- 🎀 JavaScript API for spaCy with Python REST API☆196Updated last year
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- Filters a list based on a fuzzy string search☆835Updated 3 years ago
- The largest English-language thesaurus☆291Updated 2 years ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,437Updated 2 weeks ago
- Analyse rhyme scheme, metre and form of poems☆130Updated 3 years ago
- Expose Spacy nlp text parsing to Nodejs (and other languages) via socketIO☆225Updated 2 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- This is a mirror from https://gerrit.wikimedia.org/g/mediawiki/services/parsoid/. See https://www.mediawiki.org/wiki/Developer_access for…☆158Updated this week