spencermountain / wtf_wikipedia
a pretty-committed wikipedia markup parser
☆793Updated 2 weeks ago
Alternatives and similar repositories for wtf_wikipedia:
Users that are interested in wtf_wikipedia are comparing it to the libraries listed below
- roll a wikipedia dump into mongo☆241Updated 7 months ago
- Part-of-speech utilities for node.js based on the WordNet database.☆473Updated 2 years ago
- LDA topic modeling for node.js☆294Updated 6 months ago
- 🎀 JavaScript API for spaCy with Python REST API☆196Updated last year
- MediaWiki API and WikiData client written in Node.js☆240Updated last week
- Wikipedia Interface for Node.js☆316Updated 5 months ago
- A module for node.js and the browser that takes in text and strips it of stopwords☆240Updated last month
- fasttag part of speech tagger javascript implementation☆279Updated 4 years ago
- A Python parser for MediaWiki wikicode☆779Updated last month
- This is a mirror from https://gerrit.wikimedia.org/g/mediawiki/services/parsoid/. See https://www.mediawiki.org/wiki/Developer_access for…☆157Updated this week
- Expose Spacy nlp text parsing to Nodejs (and other languages) via socketIO☆225Updated 2 years ago
- Language detection for Javascript (Node). Based on the CLD2 (Compact Language Detector) library from Google.☆321Updated 5 months ago
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.☆708Updated 7 months ago
- A client for the Stanford Part of Speech Tagger XMLRPC server.☆72Updated 7 years ago
- plugin to extract keywords and key-phrases☆332Updated 3 months ago
- A Wordnet API in pure JavaScript☆108Updated 2 years ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated 4 months ago
- Machine Learning, Natural Language Processing and Sentiment Analysis Toolkit for Node.js☆240Updated 8 years ago
- A Python library to parse MediaWiki WikiText☆299Updated 4 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆343Updated 6 years ago
- text mining utilities for Node.js☆141Updated 2 years ago
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆209Updated last year
- Node.js interface to the Google word2vec tool.☆352Updated 6 months ago
- natural language processor powered by plugins part of the @unifiedjs collective☆2,375Updated 2 weeks ago
- English NLP for Node.js and the browser.☆89Updated last year
- JavaScript MediaWiki API for node.js☆50Updated 10 months ago
- Automatically extract body content (and other cool stuff) from an html document☆2,154Updated last year
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆188Updated 6 years ago
- WordNet Database files (previously WNdb)☆215Updated 5 years ago
- read and edit a Wikibase instance from the command line☆230Updated this week