spencermountain / wtf_wikipedia
a pretty-committed wikipedia markup parser
☆806Updated last week
Alternatives and similar repositories for wtf_wikipedia:
Users that are interested in wtf_wikipedia are comparing it to the libraries listed below
- roll a wikipedia dump into mongo☆242Updated 10 months ago
- Automatically extract body content (and other cool stuff) from an html document☆2,157Updated last year
- visualise readability☆211Updated 6 months ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated 6 months ago
- 📚 Turn any web page into a clean view☆2,512Updated 4 years ago
- natural language processor powered by plugins part of the @unifiedjs collective☆2,398Updated 3 months ago
- A Wordnet API in pure JavaScript☆108Updated 2 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆197Updated last year
- Filter and format a newline-delimited JSON stream of Wikibase entities☆97Updated 6 months ago
- Wikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis☆585Updated last year
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.☆714Updated 10 months ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆93Updated 11 months ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆54Updated 8 years ago
- Wiktionary dump file parser and multilingual data extractor☆900Updated last week
- Natural language detection☆4,248Updated 10 months ago
- Parse And Create Web ARChive (WARC) files with node.js☆98Updated 3 months ago
- Simple text proofreader based on 'write-good' (hemingway-app-like suggestions) and 'nodehun' (spelling).☆336Updated 7 years ago
- visualise sentence length☆241Updated 6 months ago
- A JSON representation of Webster's Unabridged Dictionary☆683Updated 4 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- A Python parser for MediaWiki wikicode☆790Updated last month
- TextRank algorithm implementation in Javascript☆41Updated 10 years ago
- Fast HTML to markdown converter for NodeJS or the browser☆215Updated 9 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆99Updated last week
- Take the hassle out of web scraping☆466Updated 2 years ago
- A command-line tool for using CommonCrawl Index API at http://index.commoncrawl.org/☆189Updated 6 years ago
- 📕 Barebones boilerplate with Parcel 2, options handler and auto-publishing☆808Updated 3 months ago
- Lexical database of any language☆179Updated 2 years ago
- A Python library to parse MediaWiki WikiText☆307Updated 6 months ago
- A robust & multipurpose Graph object for JavaScript & TypeScript.☆1,437Updated last month